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METHODS FOR DETECTING COMPOUNDS WHICH MODULATE THE 
ACTIVITY OF AN LPA RECEPTOR 

TECHNICAL FIELD 

The present invention relates to methods for identifying and characterizing 
compounds that modulate the activity of a member of the EDG family of receptors, including 
EDG-1, EDG-2, EDG-3, EDG-4, and EDG- 5 (gene sequences and amino acid sequences 
represented by SEQ ID NOS: 1, 4, 5, 6 and 7, respectively), and a similar receptor called 
PSP-24 (gene sequence and amino acid sequence represented by SEQ ID NOS: 8 and 9, 
respectively.) 

BACKGROUND 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. Frequently, binding 
of a ligand to a cell-surface receptor represents the first step in a cascade of events that results 
in a cellular response. The ligands recognized by specific receptors include a diverse array of 
molecules such as peptides, deoxyribonucleotide triphosphates and phospholipids. 

Research into phospholipid signaling is an area of intense scientific investigation, as 
more and more bioactive lipids are being identified and their actions characterized. One 
important addition to the growing list of lipid messengers is lysophosphatidic acid (l-acyl-2- 
hydroxy-M-glycero-3-phosphate, LPA), the simplest of all glycerophospholipids. While 
LPA has long been known as a precursor of phospholipid biosynthesis in both eukaryotic and 
prokaryotic cells, only recently has LPA emerged as an intercellular signaling molecule that 
is rapidly produced and released by activated cells, notably platelets, to influence target cells 
by acting on a specific cell-surface receptor. Moolenaar (1994) Trends Cell Biol 4:213-219. 
Besides being synthesized and processed to more complex phospholipids in the endoplasmic 
reticulum, LPA can be generated through the hydrolysis of pre-existing phospholipids 
following cell activation. The best documented example concerns thrombin-activated 
platelets, where LPA production is followed by its extracellular release. Eichholtz et al. 
(1993) Biochem. J. 291 :677-680. Platelet LPA is formed, at least in large part, through 
phospholipase A 2 (PLA2)-mediated deacylation of newly generated phosphatidic acid (PA). 
Gerrard and Robinson (1989) Biochim. Biophys. Acta 1001:282-285. Distinct PA-specific 
PLA2 activity has been identified in platelets (Ca 2+ -dependent) and in rat brain (Ca 2+ - 
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independent), but little is known about its mode of regulation. Billah et al. (1981) J. Biol 
Chem. 256:5399-5403; and Thompson and Clark (1995) Biochem. J. 306:305-309. 

It remains to be examined at what stage of the platelet activation response LPA is 
produced and how it is released into the extracellular environment. Given the wide variety of 

5 LPA responsive cell types, LPA production and release are unlikely to be restricted to 

platelets. Indeed, there is preliminary evidence that growth factor-stimulated fibroblasts can 
also produce LPA. Fukami and Takenawa (1 992) J. Biol. Chem. 267:10988-10993. 
Furthermore, LPA may be formed and released by injured cells, probably due to nonspecific 
activation of phospholipases. Many other cell systems remain to be examined for LPA 

10 production. 

In freshly prepared mammalian serum, LPA concentrations are estimated to be in 
the range of approximately 2-20 |aM, with oleoyl- and palmitoyl-LPA being the 
predominant species. Tokumura et al. (1994) Am. J. Physiol. 267:C204-C210; and 
Eichholtz et al. (1993) Biochem. J. 291:677-680. LPA is not detectable in platelet-poor 

15 plasma, whole blood, or cerebrospinal fluid. Tigyi and Miledi (1992) J. Biol Chem. 

267:21360-21367. In common with long chain fatty acids, LPA binds with high affinity to 
serum albumin at a molar ratio of about 3 :1. Tigyi et al. (1991) J. Biol Chem. 266:20602- 
20609; Thumser et al. (1994) Biochem. J. 301 :801-806. It is notable that serum albumin 
contains several other, as yet unidentified lipids (methanol-extractable) with LPA-like 

20 biological activity. Tigyi and Miledi (1992) J. Biol Chem. 267:21360-21367. This raises 

the interesting possibility that LPA may belong to a new family of phospholipid mediators 
showing overlapping biological activities and acting on distinct receptors; conceivably, the 
ether-linked phospholipid platelet-activating factor (PAF) and the mitogenic lipid 
sphingosine 1-phosphate may also belong to this putative family. Zhang et al. (1991) J. 

25 Cell Biol 114:155-167. 

The range of biological responses to LPA is quite diverse, ranging from induction of 
cell proliferation to stimulation of neurite retraction and even slimemold chemotaxis, and the 
body of knowledge continues to grow as more and more cellular systems are tested for LPA 
responsiveness. Jalink et al. (1993) Proc. Natl. Acad. Scl U.S.A. 90:1857-1861; Jalink et al. 

30 (1993) Cell Growth and Differ. 4:247-255; and Moolenaar (1995) Curr. Opin. Cell Biol 

7:203-210; Dyer et al. (1992) Molec. Brain Res. 14:293-301; Dyer et al. (1992) Molec. Brain 
Res. 14:302-309; Tigyi and Miledi (1992) J. Biol Chem. 267:21360-21367. 
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Although its precise physiological and pathological functions in vivo remain to be 
explored, LPA derived from platelets has all the hallmarks of an important mediator of 
wound healing and tissue regeneration. Thus, in addition to acting as an autocrine stimulator 
of platelet aggregation, LPA stimulates the growth of fibroblasts, vascular smooth muscle 
cells, endothelial cells, and keratinocytes. Moolenaar (1994) Trends Cell Biol. 4:213-219; 
Jalink et al. (1994) Biochim. Biophys. Acta 1 198:185-196; Van Corven et al. (1989) Cell 
59:45-54; Tigyi et al. (1994) Proc. Natl Acad. Sci. U.S.A. 91:1908-1912; Tokumura et al. 
(1994) Am. J. Physiol 267:C204-C210; and Piazza etal. (1995) Exp. Cell Res. 216:51-64. 
Intriguingly, it has been observed that LPA acts as an inhibitor of eukaryotic DNA 
polymerase a. Murakami-Murofushi et al (1992) J. Biol Chem. 267:21512-21517. LPA 
also exhibits anti-mitogenic activity toward myeloma cells, presumably through a distinct 
receptor subtype. Tigyi et al. (1994) Proc. Natl Acad. Sci. 91:1908-1912; Murakami- 
Murofushi et al. (1993) Cell Structure and Function 18:363-370. 

In addition to stimulating cell growth and proliferation, LPA promotes cellular tension 
and cell-surface fibrohectin binding, which are important events in wound repair and 
regeneration. Zhang et al. (1994) 1 Cell Biol. 127:1447-1459; Kolodney et al. (1993) J. Biol 
Chem. 268:23850-23855; and Lapetina et al. (1981) J. Biochem. 256:5037-5040. As a 
product of the blood-clotting process, LPA is a normal constituent of serum (but not platelet- 
poor plasma), where it is present in an albumin-bound form at physiologically relevant 
concentrations. Tigyi and Miledi (1992) J. Biol. Chem. 267:21360-21367; and Eichholtz et 
al. (1993) Biochem. J. 291:677-680. 

Recently, anti-apoptotic activity has also been ascribed to LPA. PCT Application No. 
PCTAJS94/13649. In this study, an actively proliferating cell line was rescued from serum 
withdrawal-induced apoptosis by LPA. In another study, evidence has been presented 
suggesting that LPA can suppress apoptosis in vitro as well as in ischemic organs such as 
heart and liver. Wu et al. (1996) Transplantation (in press). 

Apoptosis is a normal physiologic process that leads to individual cell death. This 
process of programmed cell death is involved in a variety of normal and pathogenic 
biological events and can be induced by a number of unrelated stimuli. Changes in the 
biological regulation of apoptosis also occur during aging and are responsible for many of the 
conditions and diseases related to aging. Recent studies of apoptosis have implied that a 
common metabolic pathway leading to cell death may be initiated by a wide variety of 
signals, including hormones, serum growth factor deprivation, chemotherapeutic agents, 

3 



WO 99/19513 



PCT/US98/21315 



ionizing radiation, and infection by human immunodeficiency virus (HIV). Wyllie (1980) 
Nature 284:555-556; Kanter et al. (1984) Biochem. Biophys. Res. Commun. 118:392-399; 
Duke and Cohen (1986) Lymphokine Res. 5:289-299; Tomei et al. (1988) Biochem. Biophys. ' 
Res. Commun. 155:324-331; Krumanetal. (1991) J. Cell. Physiol. 148:267-273; Ameisen 
and Capron (1991) Immunol. Today 12:102-105; and Sheppard and Ascher (1992) J. AIDS 
5 : 1 43- 1 47. Agents that affect the biological control of apoptosis thus have therapeutic utility 
in numerous clinical indications. 

Cellular shrinkage, chromatin condensation, cytoplasmic blebbing, increased 
membrane permeability and interchromosomal DNA cleavage characterize Apoptotic cell 
death. Gerschenson et al. (1992) FASEB J. 6:2450-2455; and Cohen and Duke (1992) Ann. 
Rev. Immunol. 10:267-293, The blebs, small, membrane-encapsulated spheres that pinch off 
of the surface of apoptotic cells, may continue to produce superoxide radicals which damage 
surrounding cell tissue and may be involved in inflammatory processes. 

While apoptosis is a normal cellular event, pathological conditions and a variety of 
injuries can also induce it. Apoptosis is involved in a wide variety of conditions, including, 
but not limited to, cardiovascular disease; cancer regression; immune disorders, including, 
but not limited to, systemic lupus erythematosus; viral diseases; anemia; neurological 
disorders; diabetes; hair loss; rejection of organ transplants; prostate hypertrophy; obesity; 
ocular disorders; stress; aging; and gastrointestinal disorders, including, but not limited to, 
diarrhea and dysentery. In the myocardium, apoptotic cell death follows ischemia and 
reperfiision. 

In Alzheimer's disease, Parkinson's disease, Huntington's chorea, epilepsy, 
amyotrophic lateral sclerosis, stroke, ischemic heart disease, spinal cord injury and many 
viral infections, for example, abnormally high levels of cell death occur. In at least some of 
these diseases, there is evidence that the excessive cell death occurs through mechanisms 
consistent with apoptosis. Among these are 1) spinal cord injury, where the severing of 
axons deprives neurons of neurotrophic factors necessary to sustain cellular viability; 2) 
stroke, where after an initial phase of necrotic cell death due to ischemia, the rupture of dead 
cells releases excitatory neurotransmitters such as glutamate and oxygen free radicals that 
stimulate apoptosis in neighboring healthy neurons; and 3) HIV infection, which induces 
apoptosis of T-lymphocytes. 

In contrast, the level of apoptosis is decreased in cancer cells, which allows the cancer 
cells to survive longer than their normal cell counterparts. As a result of the increased 
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number of surviving cancer cells, the mass of a tumor can increase even if the doubling time 
of the cancer cells does not increase. Furthermore, the high level of expression in a cancer 
cell of the bcl-2 gene, which is involved in regulating apoptosis and, in some cases, necrotic 
cell death, renders the cancer cell relatively resistant to chemotherapeutic agents and to 
5 radiation therapy. 

There is considerable evidence of plasma membrane receptors for LPA. LPA-binding 
proteins have been reported in mammalian tissues and labeled using a photoaffinity 
crosslinker derivative. Liliom et al. (1996) Am. J. Physiol 270:C772-C778; Thomson et al. 
(1994) Mol Pharmacol. 45:718-723; and van der Bend et al. (1992) EMBOJ. 1 1:2495-2501. 
10 In laevis oocytes, LPA elicits oscillatory CP currents. Tigyi and Miledi (1992) J. Biol 

Chem. 267:21360-21367. This current, like other effects of LPA, is consistent with a plasma 
membrane receptor-mediated activation of G protein-linked signal transduction pathways. 

G proteins are comprised of three subunits: a guanyl-nucleotide binding a subunit; a P 
subunit; and a y subunit. G proteins cycle between two forms, depending on whether GDP or 
1 5 GTP is bound thereto. When GDP is bound the G a py protein exists as an inactive 

heterotrimer, the G a py complex. When GTP is bound the a subunit dissociates, leaving a Gpy 
complex. Importantly, when a G a py complex operatively associates with an activated G 
protein coupled receptor in a cell membrane, the rate of exchange of GTP for bound GDP is 
increased and, hence, the rate of dissociation of the bound the a subunit from the Gpy 
20 complex increases. This fundamental scheme of events forms the basis for a multiplicity of 

different cell signaling phenomena. 

At least four G protein-mediated signaling pathways have been identified in the action 
of LPA. These are: 1) stimulation of phospholipase C and phospholipase D; 2) inhibition of 
adenylyl cyclase; 3) activation of Ras and the downstream Raf/MAP kinase pathway; and 
25 4) tyrosine phosphorylation of focal adhesion proteins in concert with remodeling of the actin 

cytoskeleton in a Rho-dependent manner. 

GTP-binding proteins fall into two broad classes of regulatory proteins; the 
heterotrimeric G-proteins, and small GTPases. The alpha subunit of heterotrimeric G- 
proteins (Ga) and the small GTPases, as typified by the proto-oncogene Ras, share certain 
30 structural homology, and cycle between an active GTP-bound state and an inactive GDP- 

bound state. When stimulated by an appropriate signal, G-proteins and small GTPases 
become activated by the binding of GTP and physically interact with effector molecules to 
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transduce the signal to the cell. In the case of G-proteins, binding of GTP to the a subunit 

causes the low molecular weight Ga to dissociate from the G$y dimer where either Goc or 

GPy can act as the signal transducer. An intrinsic GTPase activity hydrolyses GTP to GDP 

and thus attenuates the signal. Ancillary proteins collectively known as exchange factors are 

responsible for replacing GDP for GTP and reactivating the GTP-binding protein. 

Heterotrimeric G-protein coupled receptors are a special class of receptors. It is estimated 

that G-protein coupled receptors comprise 0.1% of the human genome (including olfactory 

and visual receptors) which could place the number of different receptors in the thousands. 

The common structural feature of these receptors are seven hydrophilic membrane spanning 

domains. Based on the three dimensional model of bacterial rhodopsin, it is predicted that 

the receptors would form a barrel shaped structure with the ligand binding domains being the 

extracellular loops and/or the transmembrane domains. 

Recently, three putative receptors for LP A have been identified suggesting that 

functionally different LPA receptors may exist that dictate the particular cellular response of 

LPA. Hecht, J. H., et al. (1996) J. Cell. Biol. 135(4), 1071-1083; Macrae, A. D., et al. (1996) 

Mol. Brain. Res. 42, 245-254; An, S., et al. (1997) Biochm. Biophys. Res. Com. 231, 619- 

622; Guo, Z., et al. (1996) Proc. Natl. Acad. Sci. USA 93, 14367-14372; An, et al., J. Biol. 

Chem. (1998). Most cell types respond to LPA making it difficult to characterize the 

receptor dependency of a particular response to LPA since the response cannot be solely 

attributed to a single LPA receptor. In particular, it is difficult to assess ligand binding 

specificity of an LPA receptor without a naive cell line because other lipid receptors may 

exist with overlapping ligand specificity. Therefor, the yeast Saccharomyces cerevisiae was 

used to study the human LPA receptor EDG-2 (or Vzg-1). S. cerevisiae contain no 

endogenous LPA receptors and is therefore a potentially useful organism in which to 

functionally express LPA receptors and analyze their ligand specificity. Other mammalian 

receptors have been functionally expressed in Saccharomyces including the sommatostatin 

receptor. (Price, L. A., et al. (1995) Mol. Cell. Biol. 15(1 1), 6188-6195), the A2a adenosine 

receptor (Price, L. A., et al. (1996) Mol. Pharmacol. 50(4), 829-837) and the p2-adrenergic 

receptor (King, K., et al., (1990) Science 250, 121-123). 

Figure 1 shows a detailed schematic of the yeast pheromone-inducible MAP 

Kinase cascade. Saccharomyces contains a single heterotrimeric G-protein that is activated 

by mating factor binding to a specific receptor. Blumer, K. J., and Thorner, J. (1990) Proc. 

Natl. Acad. Sci. USA 87, 4363-4367. Upon stimulation by an occupied receptor, the a 
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subunit of the heterotrimeric G protein (Ga, the GPA1 gene product (Dietzel, C, and Kurjan, 
J. (1987) Cell 50, 1001-1010; Miyajima, L, et al. (1987) Cell 50, 1011-1019) becomes bound 
to GTP and dissociates from the Py dimer. In yeast, it is the py dimer that transduces the 
signal to Stel 1 (the MEKK equivalent (Lange-Carter, C. A., et al. (1993) Science 260, 315- 
5 319)) and Ste7 (the MEK equivalent (Neiman, A. ML, and Herskowitz, I. (1994) Proa Natl. 

Acad ScL USA 91, 3398-3402)). The active GTP-bound version of Ga is inactivated by 
hydrolysis of GTP to GDP at which time, Ga can re-associate with Gpy and attenuates the 
signal (Blinder, D., and Jenness, D. D. (1989) Mol Cell Biol 9, 3720-3726; Cole, G., (1990) 
Mol Cell Biol 10(510-517); Dietzel, C, and Kurjan, J. (1987) Cell 50, 1001-1010; 
1 0 Miyajima, I., et al. (1987) Cell 50, 101 1-1019). Like the mammalian MAP kinase, the yeast 

MAP kinases Fusl and Kssl activate a transcriptional activator, the STE12 gene product 
(Elion, E. A., et al. (1994) Mol Biol Cell 4, 495-510). Activated Stel 2 in turn activates the 
transcription of several mating-inducible genes such as FUS1 (Elion, E. A., et al. (1991) Cold 
Spring Harbor Symp. Quant. Biol 56, 41-49; Peter, M., et al. (1993) Cell 73, 747-760). To 
1 5 study the EDG-2 receptor using the yeast pheromone response pathway system, a strain 

carrying a mutation in the FAR1 gene was used. This mutation has the effect of uncoupling 
the MAP kinase cascade from cell cycle arrest allowing the yeast to continue growing during 
MAP kinase activation (Peter, M. 9 et al. (1993) Cell 73, 747-760; Peter, M., and Herskowitz, 
1. (1994) Science 265, 1228-1231). Secondly, a mutationally inactivated SST2 gene was 
20 created to increase the sensitivity of the strain to G-protein activation. The SST2 gene 

encodes a GTPase activating protein (GAP) for the Ga subunit (the GPA1 gene product) 
(Dohlman, H. G., et al. (1996) Mol Cell Biol 16(9), 5194-5209). By inactivating the SST2 
gene product, Ga remains in the GTP-bound state longer and thus increases the steady-state 
concentration of the signal transducing py dimer. Finally, to quantify the response, the 
25 bacterial lacZ gene was fused to the mating inducible FUS1 promoter to create a reporter 

gene. 

The ubiquitous presence of the response elicited by LP A in almost every cell line 
tested, combined with the amphiphilic character of LP A that makes radioligand binding 
assays extremely difficult, has presented considerable difficulties in the molecular cloning of 
30 its receptors. In view of the potential physiological significance of LP A receptors in terms of 

wound healing, cell regeneration and cell proliferation and apoptosis, there is a need for drug 
screening assays exhibiting increased specificity that facilitate the search for agonists, inverse 
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agonists, or antagonists of LP A, as well as methods for screening analogues of LP A to 
determine their ability to activate EDG-2, for elucidating the pharmacological properties of 
these proteins. 

The present invention addresses this need. Herein are described methods of screening 
for agonists or antagonists of EDG-1, EDG-2, EDG-3, EDG-4, and EDG-5, as well as 
methods of counter screening for agonists or antagonists that are specific for only one of 
these EDG receptors. 

All references cited herein are incorporated by reference in their entirety. 

SUMMARY OF THE INVENTION 
Methods of screening for pharmaceutical agents that stimulate, as well as 
pharmaceutical agents that inhibit, EDG-1, EDG-2, EDG-3, EDG-4, and EDG-5 activity are 
provided. 

The present invention encompasses a method for identifying compounds which 
modulate the activity of any of the EDG receptors, comprising the steps of: a) contacting 
recombinant host cells, modified to contain the DNA of SEQ. ID. NO. 1, 4, 5, 6, 7 or 8, 
which is operably linked to control sequences for expression, with at least one compound or 
signal whose ability to modulate the activity of the EDG receptor is sought to be determined, 
and b) analyzing the cells for a difference in functional response mediated by said receptor. 
More specifically, the present invention encompasses contacting said cells with at least one 
composition whose ability to modulate the activity of said receptor is sought to be 
determined, and monitoring said cells for a change in the level of a particular signal 
associated with activation of the EDG receptor. EDG receptors encompassed by the present 
invention include EDG-1, EDG-2, EDG-3, EDG-4, and EDG-5. An additional receptor 
encompassed by the present invention is PSP-24, a receptor of LP A discovered in mice, 
which can be used as a screen to evaluate the specificity of a particular ligand for any of the 
EDG family of receptors. For purposes of the present discussion, PSP-24 shall be 
encompassed by the expressions "EDG family of receptors" and "EDG related receptors, 
because it has similarities, including being an LPA receptor. 

Additionally, the present invention contemplates a method for modulating the signal 
transduction activity of the EDG receptor, comprising contacting said receptor with an 
effective amount of at least one compound identified by the method described above. 
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The present invention also encompasses an agonist, antagonist, inverse agonist ,or 
allosteric modulator identified by the above methods. 

In an alternative embodiment, the present invention encompasses a method for 
detecting an agonist, antagonist, inverse agonist, or allosteric modulator of an EDG receptor 

5 having activity comprising the steps of: a) exposing a compound to an EDG receptor 

coupled to a response pathway, under conditions and for a time sufficient to allow interaction 
of the compound with the EDG receptor and an associated response through the pathway, 
and b) detecting an increase or a decrease in the stimulation of the response pathway resulting 
from the interaction of the compound with the EDG receptor, relative to the absence of the 

1 0 tested compound and therefrom determining the presence of an agonist, antagonist, inverse 

agonist, or allosteric modulator. 

In yet another embodiment, the present invention encompasses a method for detecting 
an LP A agonist, antagonist, inverse agonist, or allosteric modulator of LP A receptor 
comprising the steps of a) exposing a compound to the EDG-2 receptor coupled to a response 

1 5 pathway, under conditions and for a time sufficient to allow interaction of the compound with 

the EDG-2 receptor and an associated response through the pathway, and b) detecting an 
increase or a decrease in the stimulation of the response pathway resulting from the 
interaction of the compound with the EDG-2 receptor, relative to the absence of the tested 
compound and therefrom determining the presence of an agonist, antagonist, inverse agonist, 

20 or allosteric modulator. 

In yet another embodiment, the invention encompasses a method for detecting inverse 
agonists of LP A, comprising the steps of a) exposing a compound and LP A to the EDG-2 
receptor coupled to a response pathway, under conditions and for a time sufficient to allow 
interaction of LP A with the EDG-2 receptor and an associated response through the pathway, 

25 and b) detecting an increase or a decrease in the stimulation of the response pathway, relative 

to the absence of the tested compound and therefrom determining the presence of an inverse 
agonist of LPA. 

In yet another embodiment of the present invention, a method of detecting compounds that 
modulate the interaction between a ligand of an EDG related receptor and the EDG related 
30 receptor is encompassed, comprising: exposing a labeled ligand of an EDG related receptor to 

a cell expressing said EDG related receptor; exposing a labeled compound that is believed to 
interact with an EDG related receptor to said cell, and detecting a change in the amount of 
labeled ligand bound to said cell. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a schematic of the yeast pheromone-inducible MAP Kinase cascade. 
Components of this pathway (SST2 and FAR1) that were genetically inactivated by mutation 
are identified by underlines. 

Figure 2 is a graph depicting EDG-2-mediated stimulation of FUSldacZ. Yeast cells 
carrying the galactose-inducible edg-2 gene were grown in SC media containing either 2% 
galactose (filled bars) or 2% glucose (hatched bars) for seven hours in the presence of 
lysophosphatidic acid (LP A) or on galactose in the absence of LPA (open bars). After seven 
hours, the cells were assayed for p-galactosidase (lacZ) activity. 

Figures 3 A and 3B are graphs depicting the stimulation by LPA of FUS l:\lacZ 
activity in cells expressing EDG-2 in a time and dose dependent manner, respectively. 3 A: 
Yeast cells carrying edg-2 (■) or empty vector (♦) were grown in synthetic complete media 
(S.C.) + 2% galactose for the indicated time prior to assaying p-galactosidase activity. 3B, 
Yeast cells carrying edg-2 were grown for seven hours at the indicated dose of LPA. 

Figure 4 is a graph depicting the specific activation by LPA, but not other related 
lysophospholipids or Sphingosine-1 -phosphate (SPP), of FUS l::lacZ. Yeast cells carrying 
edg-2 were grown in S.C. + 2% galactose in the presence of LPA (■), LPC (♦), LPE (•), 
LPG (A), LPS (□) or Sph-l-P (O) at the indicated concentrations for seven hours. All lyso- 
glycerophospholipid were resuspended in BBS/EDTA + 1 mg/ml fatty acid free bovine serum 
albumin to enhance solubility. 

Figure 5 is a graph depicting the specific activation of FUS l:\lacZ by LPA, but not 
Diacyl-glycerophospholipids. Yeast cells were cultured in S.C. + 2% galactose in the 
presence of PA (■), PC (♦), PE (•), PG (A) PS (□) or LPA (O) at the indicated 
concentration for seven hours. All diacyl-glycerophospholipid were resuspended in 
BBS/EDTA + lmg/ml fatty acid free bovine serum albiunin to enhance solubility. 

Figure 6 is a graph depicting the effect of the fatty acid side-chain of LPA on 
activation of FUSlwlacZ. Yeast cells expressing EDG-2 were cultured in S.C. + 2% 
galactose in the presence of 18:1 oleoyl LPA (■), 18:0 steroyl LPA (♦), 16:0 palmitoyl LPA 
(•) or 14:0 (A) at the indicated concentration for seven hours. The numerical representation 
refers to the chain length and degree of saturation. All forms of LPA were resuspended in 
BBS/EDTA + lmg/ml fatty acid free bovine serum albumin to enhance solubility. 
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Figure 7 is a graph depicting the activation of FUSlwlacZ by LP A presented either as 
a liposomal formulation or as freely soluble LPA. Yeast cells carrying EDG-2 were culture 
in S.C. + 2% galactose in the presence of freely soluble LPA (■), LPA + PC liposomes (♦), 
PC alone (O), LPA + PG liposomes (A) or PG alone (□). Note that the concentration of 
5 lipid reflects only the LPA component of the liposome. 

MODES FOR CARRYING OUT THE INVENTION 
The present invention encompasses methods for detecting substances that modulate 
EDG receptor activity. Additionally, the present invention encompasses compositions 
comprising a host organism that does not contain an endogenous LPA receptor and that has 
1 0 been transfected with an EDG receptor. EDG receptors encompassed by the present 

invention include EDG-1, EDG-2, EDG-3, EDG-4, and EDG-5. An additional receptor 
encompassed by the present invention is PSP-24, which can be used as a screen to evaluate 
the specificity of a particular ligand for any of the EDG family of receptors. 

The edg-2 gene product, a lysophosphatidic acid (LPA) receptor, also reported as vzg- 
15 1 , couples to the yeast heterotrimeric G-protein and activates a MAP kinase cascade- 

dependent reporter. The response to LPA can be quantitated by using a reporter gene, 
including, without limitation, the lacZ gene or the luc gene fused to the FUS1 promoter, a 
mating pheromone-inducible gene promoter, the HIS3 gene, or any other gene that can 
genetically compliment an auxotropic growth mutation. The yeast strain used is able to grow 
20 in the presence of activated G-protein due to a mutation in the FAR] gene. This mutation has 

the phenotypic effect of uncoupling G-protein/map kinase activation from cell cycle arrest. 

The following definitions are for the purpose of clarifying the terms used herein, and 
are not meant to be limiting. 

Vectors useful for practicing the present invention include plasmids, viruses 
25 (including phage), and integratable DNA fragments (i.e., fragments integratable into the host 

genome by homologous recombination). The vector may replicate and function 
independently of the host genome, as in the case of a plasmid, or may integrate into the 
genome itself, as in the case of an integratable DNA fragment. Suitable vectors will contain 
replicon and control sequences that are derived from species compatible with the intended 
30 expression host. For example, a promoter operable in a host cell is one which binds the RNA 

polymerase of that cell, and a ribosomal binding site operable in a host cell is one which 
binds the endogenous ribosomes of that cell. 
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DNA regions are "operably" associated when they are functionally related to each 
other. For example: a promoter is operably linked to a coding sequence if it controls the 
transcription of the sequence; a ribosome binding site is operably linked to a coding sequence 
if it is positioned so as to permit translation. 
5 Heterologous DNA sequences are expressed in a host by means of an expression 

vector. An expression vector is a replicable DNA construct in which a DNA sequence 
encoding the heterologous DNA sequence is operably linked to suitable control sequences 
capable of effecting the expression of a protein or protein subunit coded for by the 
heterologous DNA sequence in the intended host. Generally, control sequences include a 
1 0 transcriptional promoter, an optional operator sequence to control transcription, a sequence 

encoding suitable mRNA ribosomal binding sites, and (optionally) sequences which control 
the termination of transcription and translation. 

Transformed host cells of the present invention are cells which have been transformed 
or transfected with the vectors constructed using recombinant DNA techniques and express 
1 5 the protein or protein subunit coded for by the heterologous DNA sequences. 

A variety of yeast cultures, and suitable expression vectors for transforming yeast 
cells, are known. See, e.g., U.S. Pat. No. 4,745,057; U.S. Pat. No. 4,797,359; U.S. Pat. No. 
4,615,974; U.S. Pat. No. 4,880,734; U.S. Pat. No. 4,711,844; and U.S. Pat. No. 4,865,989. 
Saccharomyces cerevisiae is the most commonly used among the yeast, although a number of 
20 other strains are commonly available. See, e.g., U.S. Pat. No. 4,806,472 (Kluveromyces lactis 

and expression vectors therefor); U.S. Pat. No. 4,855,231 (Pichia pastoris and expression 
vectors therefor). Also, any species of Candida can be used. Yeast vectors may contain an 
origin of replication from the 2 micron yeast plasmid or an autonomously replicating 
sequence (ARS), a promoter, DNA encoding the heterologous DNA sequences, sequences for 
25 polyadenylation and transcription termination, and a selection gene. An exemplary plasmid is 

YRp7, (Stinchcomb et al., Nature 282, 39 (1979); Kingsman et aL, Gene 7, 141 (1979); 
Tschemper et al., Gene 10, 157 (1980)). This plasmid contains the trpl gene, which provides 
a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for 
example ATCC No. 44076 or PEP4-1 (Jones, Genetics 85, 12 (1977)). The presence of the 
30 trpl lesion in the yeast host cell genome then provides an effective environment for detecting 

transformation by growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include, but are not limited to, the 
promoters for metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 
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255, 2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7, 149 
(1968); and Holland et al, Biochemistry 17, 4900 (1978)), such as enolase, glyceraldehyde- 
3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
5 isomerase, phosphoglucose isomerase, and glucokinase. Suitable vectors and promoters for 

use in yeast expression are further described in R. Hitzeman et al., EPO Publn. No. 73,657. 
Other promoters, which have the additional advantage of transcription controlled by growth 
conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
1 0 aforementioned metallothionein and glyceraIdehyde-3 -phosphate dehydrogenase, as well as 

enzymes responsible for maltose and galactose utilization. 

In constructing suitable expression plasmids, the termination sequences associated 
with these genes may also be ligated into the expression vector 3' of the heterologous coding 
sequences to provide polyadenylation and termination of the mRNA. 
15 A "reporter gene" is a gene that is operably linked to control sequences for expression 

of a gene of interest, and that encodes a gene product that can be detected for the purpose of 
evaluating expression of the associated gene of interest. 

The "active state" of a receptor is the state at which the ligand that stimulates the 
receptor can bind to activate a signaling pathway controlled by the receptor. 
20 An "EDG receptor agonist" is defined herein as a composition that is capable of 

combining with the active state of an EDG receptor to up-regulate a signaling pathway 
controlled by an the receptor. An "EDG antagonist" is defined herein as a composition that is 
capable of combining with the EDG receptor in either the active or inactive state, thereby 
impeding the biological action of the receptor. An "inverse agonist" of EDG is defined 
25 herein as a composition that is capable of combining with the inactive state of an EDG 

receptor to down-regulate a signaling pathway controlled by the receptor. 

A composition that "modulates" the activity of an EDG receptor is defined herein as a 
composition that is capable of evoking a change in the functional response mediated by said 
receptor. 

30 As used herein, the terms "purified" or "isolated" are intended to refer to a molecule 

used in the present invention in an enriched or pure form obtainable from a natural source or 
by means of genetic engineering or synthetic chemistry. The purified protein, DNA or RNA 
of the invention may be useful in ways that the protein, DNA and RNA as they naturally 
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occur are not, such as identification of compounds selectively modulating the expression or 
the activity of the EDG-2 of the invention. 

The isolated polypeptide and polypeptide fragments of an EDG means the EDG 
which is free of one or more components of its natural environment. Purified EDG includes 
5 purified EDG in recombinant cell culture. The enriched form of the receptor refers to a 

preparation containing said receptor in a concentration higher than natural, or in a cell where 
it is not found under native conditions e.g., a cellular membrane fraction comprising said 
receptor. If the receptor is in a pure form it is substantially free from other macromolecules, 
particularly from naturally occurring proteinaceous contamination. If desired, the receptor 
1 0 may be solubilized. Preferably, the receptor of the invention is in an active state meaning that 

it has both ligand binding and signal transduction activity. Receptor activity is measured 
according to methods known in the art, e.g., using a binding assay or a functional assay, e.g., 
an assay as described below. 

The invention is further intended to encompass variants of the receptor of the 
1 5 invention. For example, a variant of an EDG receptor of the invention is a functional 

equivalent of said receptor. A functional equivalent is a protein displaying a physiological 
profile essentially identical to the profile characteristic of the particular member of the EDG 
family of receptors having the amino acid sequence set forth in SEQ ID NOS:l , 4, 5, 6, 7 or 
8. The physiological profile in vitro and in vivo includes receptor effector function, 
20 electrophysiological and pharmacological properties, e.g., selective interaction with agonists 

or antagonists. Exemplary functional equivalents may be amino acid mutants including those 
having amino acid deletions, substitutions or insertions, and glycosylation variants. 
Functional equivalents may also include EDG receptors from a different organism. The 
present invention also encompasses methods for comparing the agonist profile of other EDG 
25 related receptors such as EDG-1 (Lee, M.-J., et al. (1996) J. Biol. Chem. 271(19), 1 1272- 

1 1279; Hla, T., and Maciag, T. (1990) J. Biol Chem. 265(16), 9308-9313), EDG-3 
(Yamaguchi, F., et al. (1996) Biochem. Biophys. Res. Comm. 227, 608-614) and H218 
(Okazaki, H., et al. (1993) Biochem. Biophys. Res. Com. 190, 1 104-1 109; MacLennan, A. J., 
et al. (1994) Mol Cell Neurosci. 5, 201-209) as well as the Xenopus high-affinity LPA 
30 receptor, PSP-24 (Guo, Z., et al. (1996) Proc. Natl. Acad. ScL USA 93, 14367-14372). 

Covalent derivatives include, for example, aliphatic esters or amides of a receptor 
carboxyl group, O-acyl derivatives of hydroxyl group containing residues and N-acyl 
derivative of amino group containing residues. Such derivatives can be prepared by linkage 
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of functionalities to reactable groups which are found in the side chains and at the N- and C- 
terminus of the receptor protein. Polypeptides of this invention may be modified post- 
translationally (e.g., acetylation or phosphorylation). 

The invention also encompasses methods wherein an EDG receptor is conjugated to a 
label capable of producing a detectable signal or other functional moieties. Suitable labels 
include, but are not limited to, radionuclides, enzymes, substrates, cofactors, inhibitors, 
fluorescent dyes, chemiluminescent dyes, bioluminescent compounds and magnetic particles. 

A protein for use in the invention is obtainable from a natural source, e.g., by 
isolation^ by chemical synthesis or by recombinant techniques. 

The invention further provides a method for expressing the EDG receptors in host 
cells. Suitable host cells producing the receptor of the invention are multiplied in vitro or in 
vivo. Preferably, the host cells are transformed (transfected) with a hybrid vector comprising 
an expression cassette comprising a promoter and an edg DNA sequence. Subsequent to 
expression of the edg gene under control of the promoter, the specific EDG protein may be 
recovered. Recovery comprises, e.g., isolating the host cells comprising the receptor, e.g., 
from the culture broth. 

The term "functional 5 ' or "biologically active", when used herein as a modifier of an 
EDG, refers to a polypeptide that is able to produce one or more of the functional 
characteristics exhibited by that specific native EDG. In one embodiment, functional means 
capable of binding its specific EDG ligand. In another embodiment, functional means that a 
signal is transduced as a consequence of binding of a ligand. 

Suitable host cells include eukaryotic cells, e.g., animal cells, plant cells and fungi, 
and prokaryotic cells, such as Gram-positive and Gram-negative bacteria, e.g., Escheria coli. 

Isolated polynucleotides (or nucleic acids) encoding a polypeptide substantially 
identical to an EDG protein or portions thereof are designated edg. The term polynucleotide 
as used herein, may be DNA or RNA, either coding or noncoding strands, edg cDNA, 
genomic DNA and synthetic or semi-synthetic DNAs and RNAs. 

The invention includes modifications to edg DNA such as deletions, substitutions and 
additions particularly in the non-coding regions of genomic DNA. Such changes are useful 
to facilitate cloning and modify gene expression in methods of the present invention. 

Various substitutions can be made within the coding region that either do not alter the 
amino acid residues encoded or result in conservatively substituted amino acid residues. 
Nucleotide substitutions that do not alter the amino acid residues encoded are useful for 
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optimizing gene expression in different systems. Suitable substitutions are known to those of 
skill in the art and are made, for instance, to reflect preferred codon usage in the particular 
expression systems. 

The invention encompasses methods using functionally equivalent variants and 
5 derivatives of a particular edg which may enhance, decrease or not significantly affect the 

properties of the resultant EDG. For instance, changes in the DNA sequence that do not 
change the encoded amino acid sequence, as well as those that result in conservative 
substitutions of amino acid residues, one or a few amino acid deletions or additions, and 
substitution of amino acid residues by amino acid analogs are those which will not 
10 significantly affect its properties, such as its ability to bind to LPA or analogs thereof. 

An EDG is preferably incorporated into a vector (a virus, phage, or plasmid) which 
can be introduced by transfection or infection into a cell. The vector preferably includes one 
or more expression control sequences, in which case the cell transfected by the vector is 
capable of expressing the polypeptide. By "isolated DNA" is meant a single- or double- 
1 5 stranded DNA that is free of the genes that, in the naturally occurring genome of the animal 

from which the isolated DNA is derived, flank the edg gene. The term therefore includes, for 
example, either or both strands of an edg cDNA or an allelic variant thereof; a recombinant 
DNA which is incorporated into a vector, into an autonomously replicating plasmid or virus, 
or into the genomic DNA of a prokaryotic or eukaryotic cell; or a genomic DNA fragment 
20 (e.g. , produced by PCR or restriction endonuclease treatment of human or other genomic 

DNA). The term also includes a recombinant DNA that is part of a hybrid gene encoding 
additional polypeptide sequence. 

Stringent conditions for both DNA/DNA and DNA/RNA hybridization assays are as 
described by Sambrook et al, Molecular Cloning, A Laboratory Manual, 2nd Ed., Cold 
25 Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1 989, herein incorporated by 

reference. For example, see page 7.52 of Sambrook et al. 

Given the guidance of the present invention, the nucleic acids used in the invention 
are obtainable according to the methods well known in the art. 

For example, a DNA used in the invention is obtainable by chemical synthesis, by 
30 recombinant DNA technology or by PCR. Preparation by recombinant DNA technology may 

involve screening a suitable cDNA or genomic library. A suitable method for preparing a 
DNA or of the invention may, e.g., comprise the synthesis of a number of oligonucleotides, 
their amplification by PCR methods, and their splicing to give the desired DNA sequence. 
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Suitable libraries are commercially available or can be prepared from individual tissues or 
cell lines. 

For an individual receptor related to an EDG, the expression pattern in different 
tissues may vary. Thus, in order to isolate cDNA encoding a particular EDG-related 
5 receptor, it is advantageous to screen libraries prepared from different suitable tissues or 

cells. As a screening probe, there may be employed a DNA or RNA comprising substantially 
the entire coding region of the edg or a suitable oligonucleotide probe based on said DNA. A 
suitable oligonucleotide probe (for screening involving hybridization) is a single stranded 
DNA or RNA that has a sequence of nucleotides that includes at least 14 contiguous bases 
1 0 that are the same as (or complementary to) any 1 4 or more contiguous bases set forth in SEQ 

ID NO: 1 , 4, 5, 6, 7 or 8, and preferably at least 1 8 contiguous bases that are the same as any 
1 8 or more contiguous bases set forth in SEQ ID NO:l , 4, 5, 6, 7 or 8. The probe may be 
labeled with a suitable chemical moiety for ready detection. The nucleic acid sequences 
selected as probes should be of sufficient length and be sufficiently unambiguous so that false 
1 5 positive results are minimized. 

Preferred regions from which to construct probes include 5' and/or 3' coding 
sequences, sequences predicted to encode ligand binding sites, and the like. For example, 
either the full-length cDNA clone disclosed herein or fragments thereof can be used as 
probes. Preferably, nucleic acid probes of the invention are labeled with suitable label means 
20 for ready detection upon hybridization. For example, a suitable label means is a radiolabel. 

The preferred method of labeling a DNA fragment is by incorporating 32 P-labeled a-dATP 
with the Klenow fragment of DNA polymerase in a random priming reaction, as is well 
known in the art. Oligonucleotides are usually end-labeled with 32 P-labeled y-ATP and 
polynucleotide kinase. However, other methods (e.g., non-radioactive) may also be used to 
25 label the fragment or oligonucleotide, including, e.g., enzyme labeling and biotinylation. 

After screening the library, e.g., with a portion of DNA including substantially the 
entire edg gene or a suitable oligonucleotide based on a portion of said DNA, positive clones 
are identified by detecting a hybridization signal; the identified clones are characterized by 
restriction enzyme mapping and/or DNA sequence analysis, and then examined, e.g., by 
30 comparison with the sequences set forth herein, to ascertain whether they include a full length 

edg gene (i.e., if they include translation initiation and termination codons). If the selected 
clones are incomplete, they may be used to rescreen the same or a different library to obtain 
overlapping clones. If the library is genomic, then the overlapping clones may include exons 
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and introns. If the library is a cDNA library, then the overlapping clones will include an open 
reading frame. In both instances, complete clones can be identified by comparison with the 
DNAs and deduced amino acid sequences provided herein. 

It is envisaged that the nucleic acid of the invention can be readily modified by 
5 nucleotide substitution, nucleotide deletion, nucleotide insertion or inversion of a nucleotide 

stretch, and any combination thereof. Such modified sequences can be used to produce a 
mutant EDG that differs from the receptors found in nature. Mutagenesis may be 
predetermined (site-specific) or random. A mutation that is not a silent mutation should not 
place sequences out of reading frames and preferably will not create complementary regions 
1 0 that could hybridize to produce secondary mRN A structures such as loops or hairpins. 

The edg cDNA or genomic DNA can be incorporated into vectors for transfection of a 
host cell. Furthermore, the invention concerns a recombinant DNA which is a hybrid vector 
comprising at least one of the above mentioned genes. 

The hybrid vectors of the invention comprise an origin of replication or an 
1 5 autonomously replicating sequence, one or more dominant marker sequences and, optionally, 

expression control sequences, signal sequences and additional restriction endonuclease sites. 

Preferably, the hybrid vector of the invention comprises an above described nucleic 
acid insert operably linked to an expression control sequence, in particular those described 
hereinafter. 

20 Vectors typically perform two functions in collaboration with compatible host cells. 

One function is to facilitate the cloning of the edg gene, i.e., to produce useable quantities of 
the nucleic acid (cloning vectors). The other function is to provide for replication and 
expression of the gene constructs in a suitable host, either by maintenance as an 
extrachromosomal element or by integration into the host chromosome (expression vectors). 

25 A cloning vector comprises the DNAs as described above, an origin of replication or an 

autonomously replicating sequence, selectable marker sequences, and optionally, signal 
sequences and additional restriction sites. An expression vector additionally comprises 
expression control sequences essential for the transcription and translation of the edg gene. 
Thus, an expression vector refers to a recombinant DNA construct, such as a plasmid, a 

30 phage, recombinant virus or other vector that, upon introduction into a suitable host cell, 

results in expression of the cloned DNA. Suitable expression vectors are well known in the 
art and include those that are replicable in eukaryotic and/or prokaryotic cells. 
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Most expression vectors are capable of replication in at least one class of organisms 
but can be transfected into another organism for expression. For example, a vector is cloned 
in E. coli and then the same vector is transfected into yeast or mammalian cells even though it 
is not capable of replicating independently of the host cell chromosome. DNA may also be 
amplified by insertion into the host genome. However, the recovery of the genomic edg gene 
is more complex than that of exogenously replicated vector because restriction enzyme 
digestion is required to excise the gene. DNA can be amplified by PCR and directly 
transfected into the host cells without any replication component. 

Advantageously, expression and cloning vectors contain a selection gene also referred 
to as selectable marker. This gene encodes a protein necessary for the survival or growth of 
transformed host cells grown in a selective culture medium. Host cells not transformed with 
the vector containing the selection gene will not survive in the culture medium. Typical 
selection genes encode proteins that confer resistance to antibiotics and other toxins, e.g., 
ampicillin, neomycin, methotrexate or tetracycline, complement auxotrophic deficiencies, or 
supply critical nutrients not available from complex media. 

Suitable selectable markers for mammalian cells are those that enable the 
identification of cells competent to take up the edg gene, such as dihydrofolate reductase 
(DHFR, methotrexate resistance), thymidine kinase, or genes conferring resistance to G418 
or hygromycin. The mammalian cell transfectants are placed under selection pressure in 
which only those transfectants that are uniquely adapted to survive are those which have 
taken up and are expressing the marker. 

Expression and cloning vectors usually contain a promoter that is recognized by the 
host organism and is operably linked to the edg gene. Suitable promoters may be inducible 
or constitutive. The promoters are operably linked to the edg gene by removing the promoter 
from the source DNA by restriction enzyme digestion arid inserting the isolated promoter 
sequence into the vector. Both the native edg promoter sequence and many heterologous 
promoters may be used to direct amplification and/or expression of an edg. However, 
heterologous promoters are preferred, because they generally allow for greater transcription 
and higher yields of EDG as compared to native edg promoter. 

Promoters suitable for use with prokaryotic hosts include, for example, the p- 
lactamase and lactose promoter systems, alkaline phosphatase, a tryptophan (trp) promoter 
system and hybrid promoters such as the tac promoter. Their nucleotide sequences have been 
published thereby enabling the skilled worker to ligate them to the edg gene using linkers or 
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adaptors to supply any required restriction sites. Promoters for use in bacterial systems will 
also generally contain a Shine-Dalgarno sequence operably linked to the edg gene. 

The various DNA segments of the vector DNA are operably linked, /.<?., they are 
contiguous and placed in a functional relationship to each other. Construction of vectors 
5 according to the invention employs conventional ligation techniques. Isolated plasmids or 

DNA fragments are cleaved, tailored and religated in the form desired to generate the 
plasmids required. If desired, analysis to confirm correct sequences in the constructed 
plasmids is performed in a manner known in the art. Suitable methods for constructing 
expression vectors, preparing in vitro transcripts, introducing DNA into host cells, and 
1 0 performing analyses for assessing edg expression and function are known to those skilled in 

the art. Gene presence, amplification and/or expression may be measured in a sample 
directly, for example, by conventional Southern blotting, northern blotting to quantitate the 
transcription of mRNA, dot blotting (DNA or RNA analysis), in situ hybridization, using an 
appropriately labeled probe based on a sequence provided herein, binding assays, 
1 5 immunodetection and functional assays. 

Suitable methods for manipulation of polynucleotides include those described in a 
variety of references, including, but not limited to, Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Vol. 1-3, eds. Sambrook et al. Cold Spring Harbor Laboratory Press 
(1989); and Current Protocols in Molecular Biology, eds. Ausubel et al., Greene Publishing 
20 and Wiley-Interscience: New York (1987) and periodic updates. Those skilled in the art will 

readily envisage how these methods may be modified, if desired. 

The invention further provides host cells capable of producing edg and heterologous 
(foreign) polynucleotides encoding said receptor. The nucleic acids of the invention can be 
expressed in a wide variety of host cells, e.g., those mentioned above, that are transformed or 
25 transfected with an appropriate expression vector. EDG proteins (or a portion thereof) may 

also be expressed as fusion proteins. Recombinant cells can then be cultured under 
conditions whereby the protein(s) encoded by the particular edg is (are) expressed. 

Suitable prokaryotes include eubacteria, such as Gram-negative or Gram-positive 
organisms, such as E. coli, e.g., £. coli K-12 strains, DH5a and HB101, or bacilli. Further 
30 host cells suitable for erfg-encoding vectors include eukaryotic microbes such as filamentous 

fungi or in a preferred embodiment, yeast, e.g., Saccharomyces cerevisiae. 

The advantage of a yeast system is that yeast contain few G-protein coupled receptors 
and it is therefor a simple task to show that the response of the EDG receptor to a particular 
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phospholipid is dependent on the expression of the receptor since it is expressed from a 
galactose inducible promoter. This is in contrast to mammalian cells in which identity and 
distribution of LP A and other glycerophospholipids receptors is unclear. The results show 
that EDG-2 specifically responds to LPA. EDG-2 does not respond to other 
lysophospholipids or to diacyl-glycerophospholipids, in particular phosphatidic acid (PA) or 
to the related lipid messenger sphingosine- 1 -phosphate (SPP). 

Higher eukaryotic cells include insect, amphibian and vertebrate cells, or mammalian 
cells. The methods for expressing proteins of interest in Sf9 cells are known in the art and are 
described in, for example Current Protocols in Molecular Biology, eds. Ausubel et al., 
Greene Publishing and Wiley-Interscience: New York (1987) and references therein. In 
recent years, propagation of vertebrate cells in culture (tissue culture) has become a routine 
procedure. The host cells referred to in this application comprise cells in in vitro culture as 
well as cells that are within a host animal. 

Host cells are transfected or transformed with the above-captioned expression or 
cloning vectors of this invention and cultured in conventional nutrient media modified as 
appropriate for inducing promoters, selecting transformants, or amplifying the genes 
encoding the desired sequences. Heterologous DNA may be introduced into host cells by any 
method known in the art, such as transfection with a vector encoding a heterologous DNA by 
the calcium phosphate coprecipitation technique, by electroporation or by lipofectin-mediated 
transfection. Numerous methods of transfection are known to the skilled worker in the field. 
Successful transfection is generally recognized when any indication of the operation of this 
vector occurs in the host cell. Transformation is achieved using standard techniques 
appropriate to the particular host cells used. 

Incorporation of cloned DNA into a suitable expression vector, transfection of 
eukaryotic cells with a plasmid vector or a combination of plasmid vectors, each encoding 
one or more distinct genes or with linear DNA, and selection of transfected cells are well 
known in the art (see, e.g., Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, 
Second Edition, Cold Spring Harbor Laboratory Press). 

Transfected or transformed cells are cultured using media and culturing methods 
known in the art, preferably under conditions whereby the particular edg is expressed. 
Suitable culturing media are either commercially available or readily prepared. 

The edg gene is ligated into a vector, and introduced into suitable host cells to 
produce transformed cell lines that express the edg gene. The resulting cell line can then be 
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produced in amounts sufficient for reproducible qualitative and quantitative analysis of the 
effects of a receptor-specific agonist, antagonist or allosteric modulator. The transfected cells 
can then be employed in an drug screening assay provided hereinafter. Such drugs are useful 
in diseases associated with pathogenesis of LP A. Particularly useful for assessing the 
5 specific interaction of compounds with an EDG receptor are stably transfected cell lines 

expressing the EDG. 

Cells expressing EDG polypeptides are useful for identifying substances that bind to a 
specific EDG. Identification of substances that bind to a specific EDG may be achieved by 
assessing the ability of a test compound to inhibit the binding of labeled ligand or analog 

1 0 thereof. Another method for identification of such substances involves assessing the ability 

of a test compound to inhibit specific antibody binding to an EDG. 

Cells expressing EDG polypeptides are also useful for elucidating the signal 
transduction pathways to which EDG is coupled. By "signal transduction pathway" is meant 
the sequence of events that involves the transmission of a message from a cell-surface 

15 receptor to the cytoplasm. The signal will ultimately cause the cell to perform a particular 

function. 

Thus, host cells expressing an EDG are also useful for drug screening and it is an 
object of the present invention to provide a method for identifying a compound or signal 
which modulates the activity of the EDG. The method includes exposing cells containing 

20 heterologous edg, wherein said cells produce functional EDG, to at least one compound or 

signal whose ability to modulate the activity of said EDG is sought to be determined. The 
cells are then monitored for changes caused by the modulation. Such an assay enables the 
identification of agonists, antagonists and allosteric modulators of EDG. 

In a further aspect, the invention relates to an assay for identifying compounds that 

25 modulate the activity of any of the EDG family of receptors. The assay comprises the steps 

of: 

- contacting cells expressing an active EDG and containing heterologous edg with at 
least one compound to be tested for its ability to modulate the activity of said receptor, and 

- analyzing cells for a difference in second messenger level or receptor activity. 

30 Additionally, to determine the specificity of the compound for a particular member of 

the EDG family of receptors, the assay can further comprise the steps of: 

- contacting cells expressing one of the other EDG receptors, or the PSP-24 receptor 
and containing heterologous edg with the same compound tested above, and 
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- analyzing cells for a difference in second messenger level or receptor activity. 

If the compound only effects receptor activity in one EDG family member, it is more 
specific, and may be preferable for certain therapeutic purposes, for example, to reduce the 
likelyhood of undesired biological effects. 

5 

In particular, the invention covers an assay for identifying compounds that modulate 
the activity of EDG' s, said assay comprising: 

- contacting cells expressing an active EDG and containing heterologous edg with at 
least one compound to be tested for its ability to modulate the activity of said receptor, and 

1 o - monitoring said cells for a resulting change in second messenger activity. 

The results obtained in these assays are compared to an assay suitable as a negative 
control. 

Assay methods generally require comparison to various controls. A change in 
receptor activity or in second messenger level is "induced" by a test compound if such an 
1 5 effect does not occur in the absence of the test compound. An effect of a test compound on 

the receptor of the invention is "mediated" by the receptor if this effect is not observed in 
cells that do not express the receptor or express decreased amounts of the receptor. 

As used herein, a compound or signal that modulates the activity of an EDG receptor 
refers to a compound or signal that alters the response pathway mediated by the EDG within 
20 a cell (as compared to the absence or decreased amount of said EDG). A response pathway is 

activated by an extracellular stimulus, resulting in a change in second messenger 
concentration or enzyme activity, or resulting in a change of the activity of a membrane- 
bound protein, such as a receptor or ion channel. A variety of response pathways can be 
utilized, including but not limited to, the adenylate cyclase response pathway, the 
25 phospholipase C/intracellular calcium ion response pathway or a response pathway involving 

activation of Ras or Rho. 

Apoptosis represents another important response pathway that may be modulated by 
EDG agonists or antagonists. Suitable indications for therapeutic use of EDG agonists or 
antagonists that result in modulation of apoptotic pathways include, but are not limited to, 
30 ischemic heart disease, tumors, viral diseases such as HIV infection, neurodegenerative 

disorders, inflammatory bowel disease, hair loss, and rejection of organ transplants. 

Thus EDG expressing cells may be employed for the identification of compounds, 
particularly low molecular weight molecules capable of acting as LPA agonists or 
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antagonists. Within the context of the present invention, an agonist refers to a molecule that 
is capable of interacting with one or more EDO's, thus mimicking the action of LP A. In 
particular, an LPA agonist is characterized by its ability to interact with EDG-2, thereby 
increasing or decreasing the stimulation of a response pathway within a cell. For example, an 
agonist increases or decreases a measurable parameter within the host cell, such as the 
concentration of a second messenger or modulation of apoptosis. 

By contrast, in situations where it is desirable to decrease the activity of an EDG, 
antagonists are useful. Liliom et al. (1996) Molec. Pharmacol. 50:616-623; Bittman et al. 
(1996) J. Lipid Res. 37:391-398. Within the context of the present invention, an antagonist 
refers to a molecule that is capable of interacting with an EDG, but which does not stimulate 
a response pathway within a cell. In particular, LPA antagonists are generally identified by 
their ability to interact with EDG-2, and thereby reduce the ability of the natural ligand to 
stimulate a response pathway within a cell, e.g., by interfering with the binding of LPA to 
EDG-2 or by inhibiting other cellular functions required for the activity of EDG-2. For 
example, in a suitable assay, e.g., an assay involving suitable eukaryotic cells expressing 
EDG-2, a LPA antagonist is capable of modulating the activity of EDG-2 such that the ability 
of the natural ligand to activate the map kinase pathway is reduced. Yet another alternative 
to achieve an antagonistic effect is to rely on overexpression of antisense edg-2 RNA. 
Preferred is an agonist or antagonist selectively acting on EDG-2. 

An allosteric modulator of an EDG interacts with the receptor protein at another site 
than that recognized by any one of its particular ligands, thus acting as agonist or antagonist. 
Therefore, the screening assays described herein are also useful for detecting an allosteric 
modulator of a receptor of the invention. For example, an allosteric modulator acting as 
agonist may enhance the specific interaction between EDG-2 and LPA. For instance, if an 
allosteric modulator acts as an antagonist, it may interact with the receptor protein in such a 
way that binding of the agonist is functionally less effective. Examples include local 
anesthetics such as procaine, lidocaine, dibucaine and tetracaine. 

An in vitro assay for a LPA agonist or antagonist may require that an EDG is 
produced in sufficient amounts in a functional form using recombinant DNA methods. An 
assay is then designed to measure a functional property of the EDG, e.g., interaction with 
LPA. Production of EDG is regarded as occurring in sufficient amounts if activity of the 
receptor results in a measurable response. 
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For example, mammalian cells (available, e.g., from the American Tissue Type 
Culture Collection) are grown in appropriate culture medium. An EDG expression plasmid is 
transiently transfected into the cells, e.g., by calcium-phosphate precipitation. Ausubel, F.M. 
et al. (1993). Cell lines stably expressing the EDG may be generated, e.g., by lipofectin- 
mediated transfection with EDG expression plasmids and a plasmid comprising a selectable 
marker gene. Southern and Berg (1982) J. Mol. Appl. Genet 1:327-341. Cells surviving the 
selection are isolated and grown in the selection medium. Resistant clonal cell lines are 
analyzed, e.g., for immunoreactivity with EDG-specific antibodies or by assays for EDG 
functional responses following agonist addition. Cells producing EDG are used in a method 
for detecting compounds binding to EDG or in a method for identifying an EDG ligand 
agonist or antagonist. 

Compound bound to the target EDG may modulate functional properties of EDG and 
may thereby be identified as an EDG ligand agonist or antagonist in a functional assay. 
Functional assays are used to detect a change in the functional activity of EDO's, for 
instance, as a result of the interaction of the compound to be tested with an EDG. A 
functional response is a change (difference) in the concentration of a relevant second 
messenger influenced by the receptor of the invention within cells expressing a functional 
EDG (as compared to a negative control). Those of skill in the art can readily identify an 
assay suitable for detecting a change in the level of an intracellular second messenger 
indicative of the expression of active EDG-2 (functional assay). Examples include cAMP 
assays (see, e.g., Nakajima et al. (1992) J. Biol. Chem. 247:2437-2442); Tigyi et al. (1996) J. 
Neurochem. 66:549-558) measuring changes in inositol 1,4,5-triphosphate levels (Tigyi et al. 
(1996) J. Neurochem. 66:537-548), measuring CI" ion efflux (Postma et al. (1996) EMBO J. 
15:63-72; Watsky (1995) Am. J. Physiol. 269:C1385-C1393), or measuring changes in 
intracellular Ca 2+ levels (Tigyi et al. (1996) J. Neurochem, 66:537-548). 

More specifically, according to the invention, a method for detecting a LPA agonist 
comprises the steps of (a) exposing a compound to EDG-2 coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with EDG-2 
and an associated response through the pathway, and (b) detecting an increase or decrease in 
the stimulation of the response pathway resulting from the interaction of the compound with 
EDG-2, relative to the absence of the tested compound and therefrom determining the 
presence of a LPA agonist. 
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A method for identifying a LPA antagonist comprises the steps of (a) exposing a 
compound in the presence of a known LPA agonist to EDG-2 coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the agonist with the receptor 
and an associated response through the pathway, and (b) detecting an inhibition of the 
5 stimulation of the response pathway induced by the agonist, said inhibition resulting from the 

interaction of the compound with EDG-2, relative to the stimulation of the response pathway 
by the LPA agonist alone and determining therefrom the presence of a LPA antagonist. 
Inhibition may be detected if the test compound competes with the LPA agonist for EDG-2. 
Compounds which may be screened utilizing such a method include, but are not limited to, 
10 blocking antibodies specifically binding to EDG-2. Furthermore, such an assay is useful for 

the screening for compounds interacting with LPA. In this case, the agonistic effect is 
neutralized or reduced, e.g., by binding of the test compound to the agonist, thus affecting 
agonist interaction with the receptor. Examples are soluble EDG-2 fragments comprising 
part or all of the ligand binding domain. 
1 5 Preferentially, interaction of an agonist or antagonist with EDG-2 denotes binding of 

the agonist or antagonist to said EDG-2. 

As employed herein, conditions and times sufficient for interaction of an LPA agonist 
or antagonist candidate with the receptor will vary with the source of the receptor, however, 
conditions generally suitable for binding occur between about 4°C and about 40°C, 
20 preferably between about 4°C and about 37°C, in a buffer solution between 0 and 2 M NaCl, 

preferably between 0 and 0.9 M NaCl, with 0. 1 M NaCl being particularly preferred, and 
within a pH range of between 5 and 9, preferably between 6.5 and 8. Sufficient time for the 
binding and response will generally be between about 1 ms and about 24 h after exposure. 
Within one embodiment of the present invention, the response pathway is a 
25 membrane-bound Map Kinase pathway, and, for an agonist, the step of detecting comprises 

measuring a reduction or increase, preferably a reduction, in lacZ production by the 
membrane-bound response pathway, relative to the lacZ production in the relevant control 
setup. For the purpose of the present invention, it is preferred that the reduction or increase 
in lacZ production be equivalent or greater than the reduction or increase induced by LPA 
30 applied at a concentration corresponding to its IC 5 o value. For an antagonist, the step of 

detecting comprises measuring in the presence of the antagonist a smaller LP A-induced 
decrease or increase in lacZ production by the membrane-bound response pathway, as 
compared to the lacZ production in the absence of the antagonist. The measurement of lacZ 
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may be performed after cell destruction or by a lacZ sensitive molecular probe loaded into the 
cell. 

Yeast contains multiple MAP kinase cascades that are functionally analogous to the 

Mitogen Activated Protein Kinase (MAPK) cascade in mammalian systems (Brewster, J. L., 

5 et al. (1993) Science 259, 1760-1763; Irie, K., et al. (1993) Mol Cell Biol 13, 3076-3083; 

Neiman, A. M. s and Herskowitz, I. (1993) Trends Genet 9, 390-394; Chang, E. C, et al. 

(1994) Cell 79, 131-141). A schematic of the S. cerevisiae pheromone response pathway and 

the relevant genetic components are shown in Figure 1. The parental yeast strain, SY2069, 

contains the FUS1 promoter fused to lacZ and HIS3 integrated into different chromosomal 

10 loci and carries the far 1 -bad allele. The FAR J gene product is required for cell cycle arrest 

following exposure to mating pheromone (see Figure 1). By deleting this gene, the cells are 

able to grow in the presence of MAP kinase activation. In addition, a null mutation in the 

SST2 gene was created because it has been previously reported that the Sommatostatin 

receptor can efficiently couple to the endogenous yeast heterotrimeric G-protein after 

15 mutationally inactivating the SST2 gene (Price, L. A., Kajkowski, E. M., Hadock, J. R., 

Ozenberger, B. A., and Pausch, M. H. (1995) Mol Cell Biol 15(1 1), 6188-6195). SST2 

encodes a GTPase activating protein (GAP) for the GPA I gene product, the Ga subunit 

required for mating pheromone signal transduction. The effect of inactivating SST2 is that 

Gpal remains in the GTP-bound state longer and thus permits signaling through the Py dimer 

20 to proceed at a higher rate, resulting in a higher signal from the receptor. 

Further cell-based screening assays can be designed, e.g., by constructing cell lines in 

which the expression of a reporter protein, i.e., an easily assayable protein, such as p~ 

galactosidase, chloramphenicol acetyltransferase (CAT) or luciferase, is dependent on the 

function of an EDG. The resulting DNA construct comprising the enzyme DNA is stably 

25 transfected into a host cell. The host cell is then transfected with a second DNA construct 

containing the edg gene operably linked to additional DNA segments necessary for the 

expression of the receptor. 

Also encompassed by methods of the present invention is the use of the EDG family 

of receptors in an expression vector for radioligand binding assays, such as that described in 

30 Price, L. A. s et al. (1996) Mol. Pharmacol. 50(4), 829-837. Such methods can be used for 

detecting compounds that compete with known EDG binding ligands for binding to the EDG 

related receptors. For example, first labeled 18:1 lysophosphatidic acid ligand is introduced 

to a sample containing yeast cells expressing EDG-2 receptor to allow binding of the 18: 1 
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LPA to the EDG-2 receptor. Next, a composition to be tested for its ability to modulate the 
interaction of 1 8:1 LPA and its receptor is introduced to the yeast cell. Finally, the yeast 
sample is evaluated for an increase or decrease in the amount of labeled agonist bound to the 
yeast cell or membranes purified from the yeast cell. Methods of labeling compounds, and 
labels themselves, are known in the art and include, but are not limited to, radioisotopes, 
enzymes, fluorescent compounds, chemiluminescent compounds, bioluminescent 
compounds, substrate cofactors and inhibitors. See, for examples of patents teaching the use 
of such labels, U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. By these methods, one can identify the presence of an EDG 
related receptor modulator, such as an antagonist, inverse agonist or agonist. 

The edg gene family is useful for a variety of studies. These include, but are not 
limited to, testing LPA analogs for agonistic/antagonistic activity and for EDG-2 specificity 
when compared to other EDG family members; dissecting the molecular signal transduction 
mechanism, analyzing receptor-ligand interactions by site-directed mutagenesis; determining 
the levels and distribution of the receptors; cloning related receptors; and determining the 
mechanism of tissue-specific responses to LPA and other EDG family ligands. 

Recently, it has been discovered that a lysophospholipid effects Candida adherence in 
vitro. Prakobphol, A., et al. "Plamitoyl carnitine, a lysophospholipase-transacylase inhibitor, 
prevents Candida adherence in vitro," FEMS Microbiol Lett (1997) 151:89-94. By 
identifying a lysophosphatidic acid receptor in Candida, and inserting that receptor into an 
assay of the present invention, one can screen for potential anti-fungal agents that modulate 
the interaction of LPA and such a receptor. 

Methods of the present invention can be used to identify an LPA receptor from a 
heterologous fungal species, such as Candida. For example, one can a) contact recombinant 
host cells, modified to contain a fragment of at least 18 consecutive nucleotides of DNA 
selected from the genomic library of a chosen fungal species, which is operably linked to 
control sequences for expression, with LPA. Next, one can b) analyze the cells for a change 
in functional response to said LPA. Finally, one can repeat steps a) and b) with different 
fragments from the genomic library of the heterologous fungus until a fragment is identified 
that evokes a change in functional response upon treatment with LPA. 

The following examples are provided to illustrate but not limit the invention. 



28 



WO 99/19513 



PCT/US98/21315 



EXPERIMENTAL 

The following abbreviations, used in the experimental section and throughout the 
specification, are herein set forth: LPA (lysophosphatidic acid), LPE 
(lysophosphatidlylethanolamine), LPS (lysophosphatidlyserine), LPC 
(lysophosphatidlycholine), LPG ( lysophosphatidlyglycerol), PA (phosphatidic acid), PE 
(phosphatidylethanolamine), PS (phosphatidylserine), PC (phosphatidylcholine), PG 
(phosphatidylglycerol), Sph-l-P (shingosine-1 -phosphate), BBS (Bicarbonate buffered 
saline). 

EXAMPLE 1 

Construction of the EDG-1, EDG-2, EDG-3, EDG-4 and EDG-5 Expression Plasmids 

and Expression in Yeast 

SY2069 (Mata,farl-bad3, HIS3::pFUSl ::HIS3, mfa2-Al::pFUSl::lacZ, ura3, leu2, 
adel, arg4, trpl) was used to derive an sstZ strain for subsequent studies. SST2 was 
disrupted using pBC14 (Dohlman, H. G., et al. (1996) Mol Cell Biol 16(9), 5194-5209). 
pBC14 was digested with Ncol and transformed into SY2069 by lithium acetate using the 
URA3 gene for selection. Ura + colonies were grown in non-selective media (YEPD) and 
plated onto media containing 5-Flouro-orotic Acid (5-FOA, Sigma). The resultant 5-FOA 
resistant isolates were tested for the supersensitive phenotype by assaying lacZ activity in 
response to a-factor (data not shown). One sst2 strain was named JEY5 and used in all 
subsequent studies. Yeast were grown in SC + 2% Galactose or 2% Glucose media lacking 
Uracil. 

Construction of EDG-2 Expression Plasmid and Expression in Yeast: The EDG-2 
coding region was amplified by RT-PCR using Pfu DNA polymerase under conditions 
described by the supplier (Stratagene). The template for RT-PCR was cDNA (5ng) that was 
reverse transcribed from human fetal brain total RNA (Clontech) using Superscript II 
Reverse Transcriptase as described by the supplier (Gibco BRL). 1 (iM each of the following 
primers, FP; 5 '-GCGATAGGATCC ATC ATGGCTGCC ATCTCTACTTC-3 ' (SEQ ID NO. 

2) and RP; 5'-GCGATACTCGAGCTAAACCACAGAGTGATCATTGC-3' (SEQ ID NO. 

3) , were used for RT-PCR. Oligonucleotide Synthesis and DNA Sequencing: PCR primers 
and DNA sequencing primers were synthesized by the phosphoramidite method with an 
Applied Biosystems model 394 synthesizer, purified by polyacrylamide gel electrophoresis 
and desalted on Sep-Pak Cjg cartridges (Waters Associates, Milford, MA). The edg-2 cDNA 
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was sequenced in p YEUra3 by the dideoxy chain termination method using the T7 
Sequenase7-deaza-dGTP sequencing kit as described by the supplier (Amersham Life 
Science). The primers were designed based on the human edg-2 cDNA sequence submitted 
to Genbank by Zondag and Moolenaar (accession no. Y09479) and included restriction site 
extensions for subcloning into the pYEUra3 vector (Stratagene). This placed the cDNA 
under the control of a galactose-inducible promoter (U ASgal). The resulting plasmid was 
used to transform JEY5 by the lithium acetate method. 

Construction of EDG-1, EDG-3, EDG-4 and EDG-5 Expression Plasmid and 
Expression in Yeast: The other EDG Expression Plasmids were prepared by the same 
method described above for EDG-2, except that the coding regions were amplified by RT- 
PCR using the following primers, respectively: 

EDG-1 :FP; 5'- GCGCGGGATCCACCATGGGGCCCACCAGCGTCCCG-3' (SEQ ID 
NO. 10) 

RP; 5'- GCGCGGTCGACGG AAGA AG AGTTG ACGTTTCC-3 ' (SEQ ID NO. 1 1 ) 

EDG-3 :FP; 5'-GCGCGGGATCCACCATGGCAACTGCCCTCCCG-3' (SEQ ID NO. 12 

) 

RP; 5'-GCGCGGTCGACTCAGTTGCAGAAGATCCCATTC-3' (SEQ ID NO. 13 ) 

EDG-4:FP; 5 ' - ATC AGCGG ATCC ACC ATGGTC ATC ATGGGCC AGT-3 ' (SEQ ID NO. 14 ) 
RP; 5 ' -AGTTC ACTCG AGTC AGTCCTGTTGGTTGGGTTG-3 ' (SEQ ID NO. 15 ) 

EDG-5:FP; 5 ' -GCGCGGG ATCC ACC ATGGGCGGTTTATACTC AG AG-3 5 (SEQ ID NO. 
16) 

RP; 5 ' -GCGCGGTCGACTC AGACC ACTGTGTTGCCC-3 ' (SEQ ID NO. 17). 

Expression of the EDG-2: To test the effects of the SST2 gene product on the edg-2 
response to LPA, JEY5 (sst2A) expressing the EDG-2 receptor was compared to the parental 
SY2069 strain (SST2 + ). Figure 2 shows that the SST2* strain was unresponsive to LPA 
whereas the sst2" derivative was activated by 200jiM LPA. As a control, JEY5+EDG-2 was 
assayed in 2% glucose such that the GAL1 promoter would be repressed and thus not 
expressing the edg-2 gene (glucose repression is described in detail by Johnston, M., and 
Carlson, M. (1992) in The molecular and cellular biology of the yeast Saccharomyces: gene 
expression (Broach, J. R., Pringle, J. R., and Jones, E. W., eds), pp. 193-281, Cold Spring 
Harbor Laborartory Press, Plainview, N.Y.). 
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lacZ assays in response to phospholipids: JEY5 + pJE15 was grown on SC media 
containing either 2% galactose or 2% glucose lacking uracil to an approximate optical density 
of 0.1-0.5 prior to the addition of lipid or a-factor. LPA and other glycerophospholipids 
(Avanti Polar Lipids) were dissolved in chloroform and dried down under vacuum 

5 immediately prior to experiments and resuspended in BBS/EDTA (50mM NH4HCO3, 

104mM NaCl, 250 mM EDTA2Na) at lmg/ml with sonication until the solution was clear. 
Sphingosine-1 -phosphate (Matreya) was resuspended in ethanol/water (9:1) pH 3.0 
immediately prior to use. Fatty Acid Free Bovine Serum Albumin was obtained from Sigma 
(St. Louis, MO) and used at 0. lmg/ml in BBS/EDTA. Cells were allowed to grow for the 

1 0 indicated time (7 hours for dose response experiments) prior to assaying. \00\x\ of yeast 

culture were then added to 900nl assay buffer (per liter: 60 mM Na 2 HP0 4 , 40mM NaH 2 P04> 
lOmM KC1, 0.1 mM MgS0 4 , pH 7.0 plus 2.7ml (3-mercaptoethanol per liter) plus 50[il 
0. 1%SDS + three drops Chloroform. Cells were vortexed for 1 0 seconds and incubated for 5 
minutes at 28°C. 200(iL of 4mg/ml o-nitrophenol-p-D-galactopyranoside (ONPG, Sigma) 

1 5 were added and the reaction was incubated 30 minutes at 28°C. The assay was stopped by 

the addition of 500^1 1 M NazCC^. Color development was measured at A420 and normalized 

to A^oo- Units were expressed as Miller Units. 

Figure 2 shows that the induction of lacZ activity is dependent on 1) the pYEUra3- 
Edg2 plasmid being present in the yeast cell, 2) the yeast cell containing the pYEUra3-Edg2 
20 plasmid being grown on the sugar galactose such that the UASgal promoter which drives the 

expression of the Edg2 gene is induced and 3) lysophosphatidic acid being present. 

To further characterize the LPA response to EDG-2, the time and dose dependency of 
LPA activation was tested. As seen in figure 3 A, LPA results in a time-dependent increase in 
lacZ activity as compared to vector control with a maximal four-fold stimulation of activity at 
25 7 hours. The dose response of EDG-2 to LPA is shown in Figure 3B (E.C.50=20^iM-30nM). 

LPA concentrations above 600 |*M could not be tested due to toxicity. However, the dose 
response curve can been seen to plateau suggesting that maximal activity has been reached. 
This toxicity was seen in other glycerophospholipids tested at 200^M (see below). However, 
the response to LPA was significantly above the vector control suggesting that EDG-2 
30 confers LPA responsiveness upon yeast. LPA resuspended in BBS/EDTA + fatty acid free 

albumin showed the same activity as did freely soluble LPA (data not shown and see below). 
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The results support that the expression of EDG-2 in yeast faithfully reconstitutes 
many of the key properties of an LPA receptor. 

EXAMPLE 2 

EDG-2 responds selectively to LPA and not to other lysophospholipids or to 
corresponding diacyl-glycerophospholipids 
Yeast does not have endogenous receptors for glycerphospholipids such as LPA. 
Therefor, yeast represented an excellent naive system to evaluate the agonist binding 
specificity of EDG-2. Lysophosphatidylethanolamine (LPE), -serine (LPS), -glycerol (LPG) 
and -choline (LPC) and Sphingosine-1 -phosphate (SPP) were tested over the same dose range 
as LPA. 

The results are consistent with EDG-2 being a functional, specific LPA receptor. As 
seen in Figure 4, no other lyso-glycerolphospholipid or Sphingosine-1 -phosphate activated 
EDG-2 as well as did LPA at concentrations up to 200 |iM, the highest concentration tested 
due to toxicity. The results of a similar experiment testing the effects of the diacyl- 
glycerophospholipids are seen in Figure 5. In this experiment, no diacyl-glycerophospholipid 
significantly activated EDG-2 except phosphatidic acid (PA). However, the activity of PA 
may be due to contaminating LPA as determined by HPLC (data not shown). 

EXAMPLE 3 

Acyl-chain length dependency of EDG-2 activity by LPA 
Due to the apparent specificity of EDG-2 for LPA, the dependency of the acyl-chain 
length of the LPA molecule on EDG-2 activation was also determined. Six forms of LPA 
were tested: 24:1, 18:1 (Oleoyl), 18:0 (Stearoyl), 16:0 (Palmitoyl), 14:0 (Myristoyl) and 6:0 
(Caproyl). The experiment was carried out as described in Example 2, above, using the four 
forms of LPA. 

Figure 6 shows that those LPA molecules containing 16 or 18 carbons activated EDG- 
2. This experiment was repeated with the LPA analogs suspended in 0.1 mg/ml fatty acid 
free BSA with similar results (data not shown). 

EXAMPLE 4 

LPA in a liposomal formulation is effective in EDG-2 activation 
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To test the effects of presenting LPA to the yeast cells as a liposome rather than as 
freely soluble LPA or bound to albumin, liposomes were formed with either 
phosphatidylcholine or phosphatidylglycerol. To formulate the liposomes, LPA and dioleoyl- 
phosphatidylcholine or l-palmitoyl-2-oleoyl-phosphatidylglycerol were mixed at a weight 
ratio of 1 :9 (LPA:PC or PG) in Chloroform solution, dried under vacuum (Savant) and 
resuspended in BBS/EDTA to a total lipid concentration of lOmg/ml (LPA concentration = 
lmg/ml). The resultant opaque suspension was sonicated until the solution was clear 
(approximately 10 minutes, Lab Supplies Co., Hicksville, NY). The liposome size ranged 
from 50nm to 80nm as determined on a Coulter N4 Plus Particle Sizer. (Coulter). 

Figure 7 shows that LPA-containing liposomes showed equivalent activity towards 
EDG-2 as did freely soluble LPA. Phosphatidylcholine and phosphatidylglycerol as 
liposome preparations gave no activity on their own (Figure 4). These results support that the 
form in which LPA is presented does not significantly effect its ability to activate EDG-2; 
albumin-bound, freely soluble and liposomal formulations of LPA all activated EDG-2 with 
equal efficacy. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be apparent to 
those skilled in the art that certain changes and modifications may be practiced. Therefore, 
the description and examples should not be construed as limiting the scope of the 
invention, which is delineated by the appended claims. 
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CLAIMS 

We claim: 

1 . A method for identifying compounds that modulate the activity of an EDG 
related receptor, comprising the steps of: 

a) contacting recombinant host cells, modified to contain a fragment of at least 

1 8 consecutive nucleotides of DNA selected from the group comprising SEQ. ID. NOS. 1, 4, 
5, 6, 7 and 8, which is operably linked to control sequences for expression, with at least one 
compound or signal whose ability to modulate the activity of the particular EDG receptor is 
sought to be determined, and 

b) analyzing the cells for a difference in functional response mediated by said 
receptor. 

2. A method according to claim 1 , comprising: 

a) contacting said cells with at least one compound or signal whose ability to 
modulate the activity of said receptor is sought to be determined, and 

b) monitoring said cells for a change in the level of a particular second 
messenger. 

3. A method for modulating the signal transduction activity of an EDG receptor, 
comprising contacting said receptor with an effective amount of at least one compound 
identified by the method of claim 1 . 

4. An agonist, antagonist or allosteric modulator identified by the method of claim 1 . 

5. A method for detecting an LP A agonist or an allosteric modulator of an LP A 
receptor having agonistic activity comprising the steps of: 

a) exposing a compound to the EDG-2 receptor coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with the 
EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
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absence of the tested compound and therefrom determining the presence of an agonist or an 
allosteric modulator. 

6. A method for detecting an LP A antagonist or an allosteric modulator of LP A 
receptor having antagonistic activity comprising the steps of 

a) exposing a compound to the EDG-2 receptor coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with the 
EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an antagonist or 
an allosteric modulator. 

7. A composition comprising a viable yeast cell transformed with a plasmid 
expressing an EDG related receptor protein selected from the group comprising SEQ. ID NO: 
1 , 4, 5, 6, 7 and 8, a pheromone responsive element and a reporter gene, wherein the reporter 
gene detects the activation of the LP A receptor. 

8. The composition of claim 7, wherein said EDG related receptor protein has an 
amino acid sequence corresponding to SEQENCE ID NO. 1. 

9. The composition of claim 7, wherein said yeast cell is Saccharomyces 
cerevisiae. 

1 0. The composition of claim 7, wherein said reporter gene is lacZ. 

11. A method for identifying compounds which modulate the activity of EDG-2, 
comprising the steps of: 

a) contacting recombinant host cells, modified to contain the DNA of SEQ. ID. NO. 1 
which is operably linked to control sequences for expression, with at least one compound or 
signal whose ability to modulate the activity of the EDG-2 LPA receptor is sought to be 
determined, and 
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b) analyzing the cells for a difference in functional response mediated by said receptor. 

12. A method for modulating the signal transduction activity of the EDG-2 LP A 
receptor, comprising contacting said receptor with an effective amount of at least one 
compound identified by the method of claim 1 1 , 

13. An agonist, antagonist, inverse agonist, or allosteric modulator identified by 
the method of claim 1 1 . 

1 4. A method for detecting an LP A agonist or an allosteric modulator of an LPA 
receptor having agonistic activity comprising the steps of: 

a) exposing a compound to the EDG-2 receptor (SEQ ID NO. 1 ) coupled to a response 
pathway, under conditions and for a time sufficient to allow interaction of the compound with 
the EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an agonist or an 
allosteric modulator. 

15. A method for detecting an LPA antagonist or an allosteric modulator of LPA 
receptor having antagonistic activity comprising the steps of 

a) exposing a compound to the EDG-2 receptor (SEQ ID NO. 1 ) coupled to a response 
pathway, under conditions and for a time sufficient to allow interaction of the compound with 
the EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an antagonist or 
an allosteric modulator. 

16. A method for detecting inverse agonists of LPA, comprising the steps of 

a) exposing a compound and LPA to the EDG-2 receptor (SEQ ID NO. 1) coupled to a 
response pathway, under conditions and for a time sufficient to allow interaction of LPA with 
the EDG-2 receptor and an associated response through the pathway; and 
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b) detecting a change in the stimulation of the response pathway, relative to the 
absence of the tested compound and therefrom determining the presence of an inverse agonist 
ofLPA. 

17. A genetically engineered viable yeast cell transformed with plasmids 
expressing an EDG related receptor protein having an amino acid sequence corresponding to 
Seq. ID. NO. 1, 4, 5, 6, 7 or 8, and a reporter gene, wherein the reporter gene detects the 
activation of the EDG related receptor upon the binding of modulators of the EDG related 
receptor. 

18. The yeast cell of claim 17 wherein said yeast cell is selected from the group 
consisting of Saccharomyces cerevisiae and Saccharomyces pombe. 

19. The yeast cell of claim 17, wherein the reporter gene is lac Z. 

20. The yeast cell of claim 17, wherein said EDG related receptor protein has an 
amino acid sequence corresponding to Seq. ID. NO. 1 . 

21. A method for analyzing the specificity of compounds which modulate the 
activity of an EDG related receptor, comprising the steps of: 

a) contacting a first group of recombinant host cells, modified to contain a first 
fragment of at least 18 consecutive nucleotides of DNA selected from the group comprising 
SEQ. ID. NOS. 1, 4, 5, 6, 7 and 8, which is operably linked to control sequences for 
expression, with at least one compound or signal whose ability to modulate the activity of the 
particular EDG receptor is sought to be determined, 

b) contacting a second group of recombinant host cells, modified to contain a second 
fragment of at least 1 8 consecutive nucleotides of a different DNA selected from the group 
comprising SEQ. ID. NOS. 1, 4, 5, 6, 7 and 8, and 

c) analyzing the cells from steps a) and b) for a difference in functional response 
mediated by said first and said second receptors. 

22. A method according to claim 2 1 , comprising: 
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a) contacting said first and said second group of recombinant host cells with at least 
one compound or signal whose ability to modulate the activity of said receptor is sought to be 
determined, and 

b) monitoring said first and said second group of recombinant host cells for a change 
in the level of a particular second messenger. 

23. A method of detecting compounds that modulate the interaction between a 
ligand of an EDG related receptor and the EDG related receptor, comprising: 

a) exposing a labeled ligand of an EDG related receptor to a cell expressing said 
EDG related receptor; 

b) exposing a labeled compound that is believed to interact with an EDG related 
receptor to said cell, and 

c) detecting a change in the amount of labeled ligand bound to said cell. 

24. A method of identifying an LPA receptor from a heterologous fungal species, 
comprising: 

a) contacting recombinant host cells, modified to contain a fragment of at least 
18 consecutive nucleotides of DNA selected from the genomic library of said fungal species, 
which is operably linked to control sequences for expression, with LPA, 

b) analyzing the cells for a change in functional response, and 

c) repeating steps a) and b) with different fragments from said genomic library 
until a fragment is identified that evokes a change in functional response of said cells upon 
treatment with LPA. 
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Figure 3A 
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Figure 4 
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SEQUENCE LISTING 



<110> Erikson, James 

Goddard, J. Graham 

Kiefer, Michael 

LXR Biotechnology, Inc. 

<120> METHODS FOR DETECTING COMPOUNDS WHICH MODULATE THE 
ACTIVITY OF AN LPA RECEPTOR 

<130> 4147-6-PCT 

<140> PCT/US98/21315 
<141> 1998-10-09 

<160> 19 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 1110 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (10) . . (1101) 

<400> 1 

ggatccatc atg get gec ate tct act tec ate cct gta att tea cag ccc 51 

Met Ala Ala lie Ser Thr Ser lie Pro Val lie Ser Gin Pro 
15 10 

cag ttc aca gee atg aat gaa cca cag tgc ttc tac aac gag tec att 99 
Gin Phe Thr Ala Met Asn Glu Pro Gin Cys Phe Tyr Asn Glu Ser lie 
15 20 25 30 

gee ttc ttt tat aac cga agt gga aag cat ctt gee aca gaa tgg aac 147 
Ala Phe Phe Tyr Asn Arg Ser Gly Lys His Leu Ala Thr Glu Trp Asn 
35 40 45 

aca gtc age aag ctg gtg atg gga ctt gga ate act gtt tgt ate ttc 195 
Thr Val Ser Lys Leu Val Met Gly Leu Gly lie Thr Val Cys lie Phe 
50 55 60 

ate atg ttg gee aac eta ttg gtc atg gtg gca ate tat gtc aac cgc 243 
lie Met Leu Ala Asn Leu Leu Val Met Val Ala lie Tyr Val Asn Arg 
65 70 75 

cgc ttc cat ttt cct att tat tac eta atg get aat ctg get get gca 291 
Arg Phe His Phe Pro lie Tyr Tyr Leu Met Ala Asn Leu Ala Ala Ala 
80 85 90 

gac ttc ttt get ggg ttg gee tac ttc tat etc atg ttc aac aca gga 339 
Asp Phe Phe Ala Gly Leu Ala Tyr Phe Tyr Leu Met Phe Asn Thr Gly 
95 100 105 110 

ccc aat act egg aga ctg act gtc age aca tgg etc ctt cgt cag ggc 387 
Pro Asn Thr Arg Arg Leu Thr Val Ser Thr Trp Leu Leu Arg Gin Gly 
115 120 125 
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etc att gac acc age ctg acg gca tct gtg gee aac tta ctg get att 435 
Leu lie Asp Thr Ser Leu Thr Ala Ser Val Ala Asn Leu Leu Ala lie 
130 135 140 

gca ate gag agg cac att acg gtt ttc cgc atg cag etc cac aca egg 483 
Ala He Glu Arg His He Thr Val Phe Arg Met Gin Leu His Thr Arg 
145 150 155 

atg age aac egg egg gta gtg gtg gtc att gtg gtc ate tgg act atg 531 
Met Ser Asn Arg Arg Val Val Val Val He Val Val He Trp Thr Met 
160 165 170 

gee ate gtt atg ggt get ata ccc agt gtg ggc tgg aac tgt ate tgt 57 9 
Ala He Val Met Gly Ala He Pro Ser Val Gly Trp Asn Cys He Cys 
175 180 185 190 

gat att gaa aat tgt tec aac atg gca ccc etc tac agt gac tct tac 627 
Asp He Glu Asn Cys Ser Asn Met Ala Pro Leu Tyr Ser Asp Ser Tyr 
195 200 205 

tta gtc ttc tgg gec att ttc aac ttg gtg acc ttt gtg gta atg gtg 675 
Leu Val Phe Trp Ala He Phe Asn Leu Val Thr Phe Val Val Met Val 
210 215 220 

gtt etc tat get cac ate ttt ggc tat gtt cgc cag agg act atg aga 723 
Val Leu Tyr Ala His He Phe Gly Tyr Val Arg Gin Arg Thr Met Arg 
225 230 235 

atg tct egg cat agt tct gga ccc egg egg aat egg gat acc atg atg 771 
Met Ser Arg His Ser Ser Gly Pro Arg Arg Asn Arg Asp Thr Met Met 
240 245 250 

agt ctt ctg aag act gtg gtc att gtg ctt ggg gec ttt ate ate tgc 819 
Ser Leu Leu Lys Thr Val Val He Val Leu Gly Ala Phe He He Cys 
255 260 265 270 

tgg act cct gga ttg gtt ttg tta ctt eta gac gtg tgc tgt cca cag 867 
Trp Thr Pro Gly Leu Val Leu Leu Leu Leu Asp Val Cys Cys Pro Gin 
275 280 285 

tgc gac gtg ctg gee tat gag aaa ttc ttc ctt etc ctt get. gaa ttc 915 
Cys Asp Val Leu Ala Tyr Glu Lys Phe Phe Leu Leu Leu Ala Glu Phe 
290 295 300 

aac tct gec atg aac ccc ate att tac tec tac cgc gac aaa gaa atg 963 
Asn Ser Ala Met Asn Pro He He Tyr Ser Tyr Arg Asp Lys Glu Met 
305 310 315 

age gec acc ttt agg cag ate etc tgc tgc cag cgc agt gag aac ccc 1011 
Ser Ala Thr Phe Arg Gin He Leu Cys Cys Gin Arg Ser Glu Asn Pro 
320 325 330 

acc ggc ccc aca gaa ggc tea gac cgc teg get tec tec etc aac cac 1059 
Thr Gly Pro Thr Glu Gly Ser Asp Arg Ser Ala Ser Ser Leu Asn His 
335 340 345 350 

acc ate ttg get gga gtt cac age aat gat cac tct gtg gtt tagctcgag 1110 
Thr He Leu Ala Gly Val His Ser Asn Asp His Ser Val Val 
355 360 
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<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 2 

gcgataggat ccatcatggc tgccatctct acttc 35 



<210> 3 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 3 

gcgatactcg agctaaacca cagagtgatc attgc 35 



<210> 4 

<211> 1149 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1146) 

<400> 4 

atg gtg tec act age ate ccg gag gtt aaa get etc cgc age tea gtc 48 
Met Val Ser Thr Ser lie Pro Glu Val Lys Ala Leu Arg Ser Ser Val 
1 5 10 15 

tct gac tat ggg aac tat gat ate ata gtc egg cat tac aac tac aca 96 
Ser Asp Tyr Gly Asn Tyr Asp lie lie Val Arg His Tyr Asn Tyr Thr 
20 25 30 

ggc aag ttg aac ate ggg gcg gag aag gac cat ggc att aaa ctg act 144 
Gly Lys Leu Asn lie Gly Ala Glu Lys Asp His Gly lie Lys Leu Thr 
35 40 45 

tea gtg gtg ttc att etc ate tgc tgc ttc ate ate eta gag aat ata 192 
Ser Val Val Phe He Leu He Cys Cys Phe He He Leu Glu Asn He 
50 55 60 

ttt gtc ttg eta act att tgg aaa acc aag aag ttc cac egg ccc atg 240 
Phe Val Leu Leu Thr He Trp Lys Thr Lys Lys Phe His Arg Pro Met 
65 70 75 80 

tac tat ttc ata ggc aac eta gee etc teg gac eta tta gca ggc gtg 288 
Tyr Tyr Phe He Gly Asn Leu Ala Leu Ser Asp Leu Leu Ala Gly Val 
85 90 95 

get tac aca get aac ctg ctg ttg tct ggg gec acc act tac aag etc 336 
Ala Tyr Thr Ala Asn Leu Leu Leu Ser Gly Ala Thr Thr Tyr Lys Leu 
100 105 110 
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aca cct gcc cag tgg ttt ctg egg gaa ggg agt atg ttt gtg get etc 384 
Thr Pro Ala Gin Trp Phe Leu Arg Glu Gly Ser Met Phe Val Ala Leu 
115 120 125 

tct gca tea gtc ttc age etc ctt gcc ate gcc att gag cgc tac ate 432 
Ser Ala Ser Val Phe Ser Leu Leu Ala lie Ala lie Glu Arg Tyr lie 
130 135 140 

ace atg ctg aag atg aaa eta cac aac ggg age aac age teg cgc tec 480 
Thr Met Leu Lys Met Lys Leu His Asn Gly Ser Asn Ser Ser Arg Ser 
145 150 155 160 

ttt ctg ctg ate age gcc tgc tgg gtc ate tec etc ate ctg ggg ggc 528 
Phe Leu Leu lie Ser Ala Cys Trp Val lie Ser Leu lie Leu Gly Gly 
165 170 175 

ctg ccc age atg ggc tgg aac tgc ate age teg ctg tct age tgc tec 576 
Leu Pro Ser Met Gly Trp Asn Cys lie Ser Ser Leu Ser Ser Cys Ser 
180 185 f 190 

ace gtg etc ccg etc tac cac aag cac tat att etc ttc tgc acc ace 624 
Thr Val Leu Pro Leu Tyr His Lys His Tyr He Leu Phe Cys Thr Thr 
195 200 205 

gtc ttc act ctg etc ctg ctt tec ate gcc ate etc tac tgc agg ate 672 
Val Phe Thr Leu Leu Leu Leu Ser He Ala He Leu Tyr Cys Arg He 
210 215 220 

tac tec ttg gtc agg act cga age cgc cgc ctg acc ttc cgc aag aac 720 
Tyr Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arg Lys Asn 
225 230 235 240 

ate tec aag ggc agt cgc agt tct gag aag tct ctg gcc ttg ttg aag 768 
He Ser Lys Gly Ser Arg Ser Ser Glu Lys Ser Leu Ala Leu Leu Lys 
245 250 255 

acg gtg ate att gtc ttg agt gtc ttc att gcc tgc tgg gcc cct etc 816 
Thr Val He He Val Leu Ser Val Phe He Ala Cys Trp Ala Pro Leu 
260 265 270 

ttc ate eta eta ctg tta gat gtg ggc tgc aag gcg aag acc tgt gac 864 
Phe He Leu Leu Leu Leu Asp Val Gly Cys Lys Ala Lys Thr Cys Asp 
275 280 285 

ate ctg tac aaa gca gag tac ttc ctg gtt ctg get gtg ctg aac tea 912 
He Leu Tyr Lys Ala Glu Tyr Phe Leu Val Leu Ala Val Leu Asn Ser 
290 295 300 

ggt acc aac ccc ate ate tac act ctg acc aac aag gag atg cgc egg 960 
Gly Thr Asn Pro He He Tyr Thr Leu Thr Asn Lys Glu Met Arg Arg 
305 310 315 320 

gcc ttc ate egg ate gta tct tgt tgc aaa tgc ccc aac gga gac tct 1008 
Ala Phe He Arg He Val Ser Cys Cys Lys Cys Pro Asn Gly Asp Ser 
325 330 335 

get ggc aaa ttc aag agg ccc ate ate cca ggc atg gaa ttt age cgc 1056 
Ala Gly Lys Phe Lys Arg Pro He He Pro Gly Met Glu Phe Ser Arg 
340 345 350 

age aaa tea gac aac tec tct cac ccc cag aag gac gat ggg gac aac 1104 
Ser Lys Ser Asp Asn Ser Ser His Pro Gin Lys Asp Asp Gly Asp Asn 
355 360 365 
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cca gag acc att atg teg tct gga aac gtc aat tct tct tec taa 1149 
Pro Glu Thr lie Met Ser Ser Gly Asn Val Asn Ser Ser Ser 
370 375 380 



<210> 5 

<211> 1137 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1137) 

<400> 5 

atg gca act gec etc ccg ccg cgt etc cag ccg gtg egg ggg aac gag 48 

Met Ala Thr Ala Leu Pro Pro Arg Leu Gin Pro Val Arg Gly Asn Glu 
15 10 15 

acc ctg egg gag cat tac cag tac gtg ggg aag ttg gcg' ggc agg ctg 96 
Thr Leu Arg Glu His Tyr Gin Tyr Val Gly Lys Leu Ala Gly Arg Leu 
20 25 30 

aag gag gec tec gag ggc age acg etc acc acc gtg etc ttc ttg gtc 144 
Lys Glu Ala Ser Glu Gly Ser Thr Leu Thr Thr Val Leu Phe Leu Val 
35 40 45 

ate tgc age ttc ate gtc ttg gag aac ctg atg gtt ttg att gee ate 192 
lie Cys Ser Phe lie Val Leu Glu Asn Leu Met Val Leu He Ala He 
50 55 60 

tgg aaa aac aat aaa ttt cac aac cgc atg tac ttt ttc att ggc aac 240 
Trp Lys Asn Asn Lys Phe His Asn Arg Met Tyr Phe Phe He Gly Asn 
65 70 75 80 

ctg get etc tgc gac ctg ctg gee ggc ate get tac aag gtc aac att 288 
Leu Ala Leu Cys Asp Leu Leu Ala Gly He Ala Tyr Lys Val Asn He 
85 90 95 

ctg atg tct ggc aag aag acg ttc age ctg tct ccc acg gtc tgg ttc 336 
Leu Met Ser Gly Lys Lys Thr Phe Ser Leu Ser Pro Thr Val Trp Phe 
100 105 110 

etc agg gag ggc agt atg ttc gtg gec ctt ggg gcg tec acc tgc age 384 
Leu Arg Glu Gly Ser Met Phe Val Ala Leu Gly Ala Ser Thr Cys Ser 
115 120 125 

tta ctg gec ate gec ate gag egg cac ttg aca atg ate aaa atg agg 432 
Leu Leu Ala He Ala lie Glu Arg His Leu Thr Met He Lys Met Arg 
130 135 140 

cct tac gac gee aac aag agg cac cgc gtc ttc etc ctg ate ggg atg 480 
Pro Tyr Asp Ala Asn Lys Arg His Arg Val Phe Leu Leu He Gly Met 
145 150 155 160 

tgc tgg etc att gec ttc acg ctg ggc gee ctg ccc att ctg ggc tgg 528 
Cys Trp Leu lie Ala Phe Thr Leu Gly Ala Leu Pro lie Leu Gly Trp 
165 170 175 

aac tgc ctg cac aat etc cct gac tgc tct acc ate ctg ccc etc tac 576 
Asn Cys Leu His Asn Leu Pro Asp Cys Ser Thr He Leu Pro Leu Tyr 
180 185 190 
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tec aag aag tac att gec ttc tgc ate age ate ttc acg gee ate ctg 624 
Ser Lys Lys Tyr lie Ala Phe Cys lie Ser lie Phe Thr Ala lie Leu 
195 200 205 

gtg acc ate gtg ate etc tac gca cgc ate tac ttc ctg gtg aag tec 672 
Val Thr lie Val lie Leu Tyr Ala Arg lie Tyr Phe Leu Val Lys Ser 
210 215 220 

age age cgt aag gtg gee aac cac aac aac teg gag egg tec atg gca 720 
Ser Ser Arg Lys Val Ala Asn His Asn Asn Ser Glu Arg Ser Met Ala 
225 230 235 240 

ctg ctg egg acc gtg gtg att gtg gtg age gtg ttc ate gee tgc tgg 768 
Leu Leu Arg Thr Val Val lie Val Val Ser Val Phe lie Ala Cys Trp 
245 250 255 

tec cca etc ttc ate etc ttc etc att gat gtg gee tgc agg gtg cag 816 
Ser Pro Leu Phe lie Leu Phe Leu lie Asp Val Ala Cys Arg Val Gin 
260 265 270 

gcg tgc ccc ate etc ttc aag get cag tgg ttc ate gtg ttg get gtg 864 
Ala Cys Pro lie Leu Phe Lys Ala Gin Trp Phe lie Val Leu Ala Val 
275 280 285 

etc aac tec gee atg aac ccg gtc ate tac acg ctg gec age aag gag 912 
Leu Asn Ser Ala Met Asn Pro Val lie Tyr Thr Leu Ala Ser Lys Glu 
290 295 300 

atg egg egg gec ttc ttc cgt ctg gtc tgc aac tgc ctg gtc agg gga 960 
Met Arg Arg Ala Phe Phe Arg Leu Val Cys Asn Cys Leu Val Arg Gly 
305 310 315 320 

c 99 999 9 CC C 9 C 9 CC tca c cc ate cag cct gcg etc gac cca age aga 1008 
Arg Gly Ala Arg Ala Ser Pro lie Gin Pro Ala Leu Asp Pro Ser Arg 
325 330 335 

agt aaa tca age age age aac aat age age cac tct ccg aag gtc aag 1056 
Ser Lys Ser Ser Ser Ser Asn Asn Ser Ser His Ser Pro Lys Val Lys 
340 345 350 

gaa gac ctg ccc cac aca gac ccc tca tec tgc ate atg gac aag aac 1104 
Glu Asp Leu Pro His Thr Asp Pro Ser Ser Cys lie Met Asp Lys Asn 
355 360 365 



gca gca ctt cag aat ggg ate ttc tgc aac tga 
Ala Ala Leu Gin Asn Gly lie Phe Cys Asn 
370 375 



1137 



<210> 6 

<211> 1734 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (85) . . (1230) 

<400> 6 

ggcacgaggc gccgggccat gggectcgag cccgccccga acccccgcga gcccgccttg 60 
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tctgcggcgt gactggaggc ccag atg gtc ate atg ggc cag tgc tac tac 111 

Met Val He Met Gly Gin Cys Tyr Tyr 

1 5 

aac gag acc ate ggc ttc ttc tat aac aac agt ggc aaa gag etc age 159 
Asn Glu Thr He Gly Phe Phe Tyr Asn Asn Ser Gly Lys Glu Leu Ser 
10 15 20 25 

tec cac tgg egg ccc aag gat gtg gtc gtg gtg gca ctg ggg ctg acc 207 
Ser His Trp Arg Pro Lys Asp Val Val Val Val Ala Leu Gly Leu Thr 
30 35 40 

gtc age gtg ctg gtg ctg ctg acc aat ctg ctg gtc ata gca gec ate 255 
Val Ser Val Leu Val Leu Leu Thr Asn Leu Leu Val He Ala Ala He 
45 50 55 

gec tec aac cgc cgc ttc cac cag ccc ate tac tac ctg etc ggc aat 303 
Ala Ser Asn Arg Arg Phe His Gin Pro He Tyr Tyr Leu Leu Gly Asn 
60 65 70 

ctg gee gcg get gac etc ttc gcg ggc gtg gec tac etc ttc etc atg 351 
Leu Ala Ala Ala Asp Leu Phe Ala Gly Val Ala Tyr Leu Phe Leu Met 
75 80 85 

ttc cac act ggt ccc cgc aca gee cga ctt tea ctt gag ggc tgg ttc 399 
Phe His Thr Gly Pro Arg Thr Ala Arg Leu Ser Leu Glu Gly Trp Phe 
90 95 100 105 

ctg egg cag ggc ttg ctg gac aca age etc act gcg teg gtg gec aca 447 
Leu Arg Gin Gly Leu Leu Asp Thr Ser Leu Thr Ala Ser Val Ala Thr 
110 115 120 

ctg ctg gee ate gee gtg gag egg cac cgc agt gtg atg gec gtg cag 495 
Leu Leu TVla He Ala Val Glu Arg His Arg Ser Val Met Ala Val Gin 
125 130 135 

ctg cac age cgc ctg ccc cgt ggc cgc gtg gtc atg etc att gtg ggc 543 
Leu His Ser Arg Leu Pro Arg Gly Arg Val Val Met Leu lie Val Gly 
140 145 150 

gtg tgg gtg get gee ctg ggc ctg ggg ctg ctg cct gee cac tec tgg 591 
Val Trp Val Ala Ala Leu Gly Leu Gly Leu Leu Pro Ala His Ser Trp 
155 160 165 

cac tgc etc tgt gec ctg gac cgc tgc tea cgc atg gca ccc ctg etc 639 
His Cys Leu Cys Ala Leu Asp Arg Cys Ser Arg Met Ala Pro Leu Leu 
170 175 180 185 

age cgc tec tat ttg gee gtc tgg get ctg teg age ctg ctt gtc ttc 687 
Ser Arg Ser Tyr Leu Ala Val Trp Ala Leu Ser Ser Leu Leu Val Phe 
190 195 200 

ctg etc atg gtg get gtg tac acc cgc att ttc ttc tac gtg egg egg 735 
Leu Leu Met Val Ala Val Tyr Thr Arg He Phe Phe Tyr Val Arg Arg 
205 210 215 

cga gtg cag cgc atg gca gag cat gtc age tgc cac ccc cgc tac cga 783 
Arg Val Gin Arg Met Ala Glu His Val Ser Cys His Pro Arg Tyr Arg 
220 225 230 

gag acc acg etc age ctg gtc aag act gtt gtc ate ate ctg ggg gcg 831 
Glu Thr Thr Leu Ser Leu Val Lys Thr Val Val He He Leu Gly Ala 
235 240 245 
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ttc gtg gtc tgc tgg aca cca ggc cag gtg gta ctg etc ctg gat ggt 879 
Phe Val Val Cys Trp Thr Pro Gly Gin Val Val Leu Leu Leu Asp Gly 
250 255 260 265 

tta ggc tgt gag tec tgc aat gtc ctg get gta gaa aag tac ttc eta 927 
Leu Gly Cys Glu Ser Cys Asn Val Leu Ala Val Glu Lys Tyr Phe Leu 
'270 275 280 

ctg ttg gee gag gec aac tea ctg gtc aat get get gtg tac tct tgc 975 
Leu Leu Ala Glu Ala Asn Ser Leu Val Asn Ala Ala Val Tyr Ser Cys 
285 290 295 

cga gat get gag atg cgc cgc ace ttc cgc cgc ctt etc tgc tgc gcg 1023 
Arg Asp Ala Glu Met Arg Arg Thr Phe Arg Arg Leu Leu Cys Cys Ala 
300 305 310 

tgc etc cgc cag tec ace cgc gag tct gtc cac tat aca tec tct gee 1071 
Cys Leu Arg Gin Ser Thr Arg Glu Ser Val His Tyr Thr Ser Ser Ala 
315 320 325 

cag gga ggt gee age act cgc ate atg ctt ccc gag aac ggc cac cca 1119 
Gin Gly Gly Ala Ser Thr Arg lie Met Leu Pro Glu Asn Gly His Pro 
330 335 340 345 

ctg atg act cca ccc ttt age tac ctt gaa ctt cag egg tac gcg gca 1167 
Leu Met Thr Pro Pro Phe Ser Tyr Leu Glu Leu Gin Arg Tyr Ala Ala 
350 355 360 

age aac aaa tec aca gee cct gat gac ttg tgg gtg etc ctg get caa 1215 
Ser Asn Lys Ser Thr Ala Pro Asp Asp Leu Trp Val Leu Leu Ala Gin 
365 370 375 

ccc aac caa cag gac tgactgactg gcaggacaag gtctggcatg gcacagcacc 1270 
Pro Asn Gin Gin Asp 
380 



actgccaggc 


ctceccaggc 


acaccactct 


geccagggaa 


tgggggcttt 


gggtcatctc 


1330 


ccactgcctg 


ggggagtcag 


atggggtgca 


ggaatctggc 


tcttcagcca 


tctcaggttt 


1390 


agggggtttg 


taacagacat 


tattctgttt 


teactgegta 


tccttggtaa 


gccctgtgga 


1450 


ctggttcctg 


ctgtgtgatg 


ctgagggttt 


taaggtgggg 


agagataagg 


gctctctcgg 


1510 


gccatgctac 


ceggtatgae 


tgggtaatga 


ggacagactg 


tggacacccc 


atctacctga 


1570 


gtctgattct 


ttagcagcag 


agactgaggg 


gtgcagagtg 


tgagctggga 


aaggtttgtg 


1630 


gctccttgca 


gcctccaggg 


actggcctgt 


ccccaataga 


attgaagcag tecaegggga 


1690 


ggggatgata 


caaggagtaa 


acctttcttt 


acactcaaaa 


aaaa 




1734 



<210> 7 
<211> 1059 
<212> DNA 
<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . . (1056) 
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<400> 7 

atg ggc ggt tta tac tea gag tac etc aat cct gag aag gtt cag gaa 48 
Met Gly Gly Leu Tyr Ser Glu Tyr Leu Asn Pro Glu Lys Val Gin Glu 
15 10 15 

cac tac aat tac acc aag gag acg ctg gac atg cag gag acg ccc tec 96 
His Tyr Asn Tyr Thr Lys Glu Thr Leu Asp Met Gin Glu Thr Pro Ser 
20 25 30 

cgc aag gtg gec tec gec ttc ate ate att tta tgc tgt gec ate gtg 144 
Arg Lys Val Ala Ser Ala Phe lie lie lie Leu Cys Cys Ala lie Val 
35 40 45 

gtg gag aac ctt ctg gtg eta ate gca gtg gec agg aac age aag ttc 192 
Val Glu Asn Leu Leu Val Leu lie Ala Val Ala Arg Asn Ser Lys Phe 
50 55 60 

cac tea gec atg tac ctg ttc etc ggc aac ctg gca gec tec gac ctg 240 
His Ser Ala Met Tyr Leu Phe Leu Gly Asn Leu Ala Ala Ser Asp Leu 
65 70 75 80 

ctg gca ggc gtg gec ttc gtg gee aac acc ttg etc tec gga cct gtc 288 
Leu Ala Gly Val Ala Phe Val Ala Asn Thr Leu Leu Ser Gly Pro Val 
85 90 95 

acc ctg tec tta act ccc ttg cag tgg ttt gee cga gag ggt tea gee 336 
Thr Leu Ser Leu Thr Pro Leu Gin Trp Phe Ala Arg Glu Gly Ser Ala 
100 105 110 

ttc ate acg etc tct gec teg gtc ttc age etc ctg gec att gee ate 384 
Phe lie Thr Leu Ser Ala Ser Val Phe Ser Leu Leu Ala lie Ala lie 
115 120 125 

gag aga caa gtg gee ate gee aag gtc aag etc tac ggc agt gac aaa 432 
Glu Arg Gin Val Ala lie Ala Lys Val Lys Leu Tyr Gly Ser Asp Lys 
130 135 140 

age tgt cga atg ttg atg etc att ggg gec tct tgg ctg ata teg ctg 480 
Ser Cys Arg Met Leu Met Leu lie Gly Ala Ser Trp Leu He Ser Leu 
145 150 155 160 

att ctg ggt ggc ttg ccc ate ctg ggc tgg aat tgt ctg gac cat ctg 528 
He Leu Gly Gly Leu Pro He Leu Gly Trp Asn Cys Leu Asp His Leu 
165 170- 175 

gag get tgc tec act gtg ctg ccc etc tat get aag cac tat gtg etc 576 
Glu Ala Cys Ser Thr Val Leu Pro Leu Tyr Ala Lys His Tyr Val Leu 
180 185 190 

tgc gtg gtc acc ate ttc tct gtc ate tta ctg get ate gtg gec ttg 624 
Cys Val Val Thr He Phe Ser Val He Leu Leu Ala He Val Ala Leu 
195 200 205 

tac gtc cga ate tac ttc gta gtc cgc tea age cat gcg gac gtt get 672 
Tyr Val Arg He Tyr Phe Val Val Arg Ser Ser His Ala Asp Val Ala 
210 215 220 

ggt cct cag acg ctg gee ctg etc aag aca gtc acc ate gta ctg ggt 720 
Gly Pro Gin Thr Leu Ala Leu Leu Lys Thr Val Thr He Val Leu Gly 
225 230 235 240 
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gtt ttc ate ate tgc tgg ctg ccg get ttt age ate ctt etc tta gac 768 
Val Phe lie lie Cys Trp Leu Pro Ala Phe Ser lie Leu Leu Leu Asp 
245 250 255 

tct ace tgt ccc g'tc egg gee tgt cct gtc etc tac aaa gee cat tat 816 
Ser Thr Cys Pro Val Arg Ala Cys Pro Val Leu Tyr Lys Ala His Tyr 
260 265 270 

ttc ttt gee ttc gee acc etc aac tct ctg etc aac cct gtc ate tat 864 
Phe Phe Ala Phe Ala Thr Leu Asn Ser Leu Leu Asn Pro Val lie Tyr 
275 280 285 

aca tgg cgt age egg gac ctt egg agg gag gta ctg agg ccc ctg ctg 912 
Thr Trp Arg Ser Arg Asp Leu Arg Arg Glu Val Leu Arg Pro Leu Leu 
290 295 300 

tgc tgg egg cag ggg aag gga gca aca ggg cgc aga ggt ggg aac cct 960 
Cys Trp Arg Gin Gly Lys Gly Ala Thr Gly Arg Arg Gly Gly Asn Pro 
305 310 315 320 

ggt cac cga etc ctg ccc etc cgc age tec age tec ctg gag aga ggc 1008 
Gly His Arg Leu Leu Pro Leu Arg Ser Ser Ser Ser Leu Glu Arg Gly 
325 330 335 

ttg cat atg cct aca teg cca aca ttt ctg gag ggc aac aca gtg gtc 1056 
Leu His Met Pro Thr Ser Pro Thr Phe Leu Glu Gly Asn Thr Val Val 
340 345 350 

tga 1059 



<210> 8 
<211> 1122 
<212> DNA 

<213> Murinae gen. sp. 

<220> 

<221> CDS 

<222> (1) . . (1119) 

<400> 8 

atg gee tgt aac age aca ccc atg ggg act tac gaa cat ctg ctg ctg 48 

Met Ala Cys Asn Ser Thr Pro Met Gly Thr Tyr Glu His Leu Leu Leu 

1 5 10 15 

aat gtg age aac act ttg gac cct ggg gac acc cca ctg tct gca ccg 96 
Asn Val Ser Asn Thr Leu Asp Pro Gly Asp Thr Pro Leu Ser Ala Pro 
20 25 30 

etc agg ate teg ctg gca ata atg atg ctg ctg atg ate gtg gta gga 144 
Leu Arg lie Ser Leu Ala lie Met Met Leu Leu Met lie Val Val Gly 
35 40 45 

ttc ctt ggc aac acg gtg gtc tgc ate ate gtg tac cag agg cca gee 192 
Phe Leu Gly Asn Thr Val Val Cys He He Val Tyr Gin Arg Pro Ala 
50 55 60 

atg cgt tea get ate aac ctg ctg ctg gee acc ttg gee ttc tec gac 240 
Met Arg Ser Ala He Asn Leu Leu Leu Ala Thr Leu Ala Phe Ser Asp 
65 70 75 80 
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ate atg ctg tct tta tgc tgc atg cca ttc acg gec ate acc etc ate 288 
lie Met Leu Ser Leu Cys Cys Met Pro Phe Thr Ala He Thr Leu He 
85 90 95 

act gtt cgc tgg cat ttc ggg gac cac ttt tgt egg ctt tea get act 336 
Thr Val Arg Trp His Phe Gly Asp His Phe Cys Arg Leu Ser Ala Thr 
100 105 110 

etc tat tgg ttt ttt gtc eta gag ggc gtg gee ate ctg etc ate att 384 
Leu Tyr Trp Phe Phe Val Leu Glu Gly Val Ala He Leu Leu He He 
115 120 125 

age gtg gac egg ttt etc ate ate gtg cag cgt cag gac aag ctg aac 432 
Ser Val Asp Arg Phe Leu He He Val Gin Arg Gin Asp Lys Leu Asn 
130 135 140 

cca cgc agg get aag atg ate ate gcg gee tec tgg gtg ctg tct ttc 480 
Pro Arg Arg Ala Lys Met He He Ala Ala Ser Trp Val Leu Ser Phe 
145 150 155 160 

tgc ate tct gcg ccc tec ttc act ggc tgg acg ttc atg gag gtg cct 528 
Cys He Ser Ala Pro Ser Phe Thr Gly Trp Thr Phe Met Glu Val Pro 
165 170 175 

get cga gee cca cag tgc gtg eta ggc tac act gag ttc cca get gaa 576 
Ala Arg Ala Pro Gin Cys Val Leu Gly Tyr Thr Glu Phe Pro Ala Glu 
180 185 190 

cgc gee tat gta gtg aca ctg gtg gtg gca gtg ttc ttt get ccc ttc 624 
Arg Ala Tyr Val Val Thr Leu Val Val Ala Val Phe Phe Ala Pro Phe 
195 200 205 

ggc gtc atg ttg tgc tec tat ctg tgc ate etc aat acg gtg egg aag 672 
Gly Val Met Leu Cys Ser Tyr Leu Cys He Leu Asn Thr Val Arg Lys 
210 215 220 

aac get gtc cgt gtg cac aac cag teg gac age ctg gac etc aga cag 720 
Asn Ala Val Arg Val His Asn Gin Ser Asp Ser Leu Asp Leu Arg Gin 
225 230 235 240 

ctg acc gga get ggc ctg aga cgt etc aga egg cag cag cag cag gee 768 
Leu Thr Gly Ala Gly Leu Arg Arg Leu Arg Arg Gin Gin Gin Gin Ala 
245 250 255 

age ctg gac ctg agt ttc aaa acc aag gee ttc acc acc ate etc ate 816 
Ser Leu Asp Leu Ser Phe Lys Thr Lys Ala Phe Thr Thr He Leu He 
260 265 270 

etc ttc gtg ggc ttt tea etc tgc tgg ctg cca cac tea gtc tac age 864 
Leu Phe Val Gly Phe Ser Leu Cys Trp Leu Pro His Ser Val Tyr Ser 
275 280 285 

ctg ctg tct gcg ttc age egg egg ttc tat tac age gec tec ttc tac 912 
Leu Leu Ser Ala Phe Ser Arg Arg Phe Tyr Tyr Ser Ala Ser Phe Tyr 
290 295 300 

acc acc age aca tgc gtc ctg tgg etc agt tac etc aag tct gtt ttc 960 
Thr Thr Ser Thr Cys Val Leu Trp Leu Ser Tyr Leu Lys Ser Val Phe 
305 310 315 320 

aac ccc ate gtc tac tgc tgg agg ate aag aaa ttc cgc gag gee tgc 1008 
Asn Pro He Val Tyr Cys Trp Arg He Lys Lys Phe Arg Glu Ala Cys 
325 330 335 
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ata gag ttg ctt ccc cac act ttc caa ate etc cct aaa gtg cct gag 1056 

He Glu Leu Leu Pro His Thr Phe Gin He Leu Pro Lys Val Pro Glu 
340 345 350 

egg ate cag agg aaa ate cag cca age acc ate tat gtg tgc aac gaa 1104 

Arg He Gin Arg Lys He Gin Pro Ser Thr He Tyr Val Cys Asn Glu 
355 360 365 

aac caa tec get gtc tag 1122 

Asn Gin Ser Ala Val 
370 



<210> 9 
<211> 373 
<212> PRT 
<213> Mus sp. 

<400> 9 

Met Ala Cys Asn Ser Thr Pro Met Gly Thr Tyr Glu His Leu Leu Leu 
15 10 15 

Asn Val Ser Asn Thr Leu Asp Pro Gly Asp Thr Pro Leu Ser Ala Pro 
20 25 30 

Leu Arg He Ser Leu Ala He Met Met Leu Leu Met He Val Val Gly 
35 40 45 

Phe Leu Gly Asn Thr Val Val Cys He He Val Tyr Gin Arg Pro Ala 
50 55 60 

Met Arg Ser Ala He Asn Leu Leu Leu Ala Thr Leu Ala Phe Ser Asp 
65 70 75 80 

He Met Leu Ser Leu Cys Cys Met Pro Phe Thr Ala He Thr Leu He 
85 90 95 

Thr Val Arg Trp His Phe Gly Asp His Phe Cys Arg Leu Ser Ala Thr 
100 105 110 

Leu Tyr Trp Phe Phe Val Leu Glu Gly Val Ala He Leu Leu He He 
115 120 125 

Ser Val Asp Arg Phe Leu He He Val Gin Arg Gin Asp Lys Leu Asn 
.130 135 140 

Pro Arg Arg Ala Lys Met He He Ala Ala Ser Trp Val Leu Ser Phe 
145 150 155 160 

Cys He Ser Ala Pro Ser Phe Thr Gly Trp Thr Phe Met Glu Val Pro 
165 170 175 

Ala Arg Ala Pro Gin Cys Val Leu Gly Tyr Thr Glu Phe Pro Ala Glu 
180 185 190 

Arg Ala Tyr Val Val Thr Leu Val Val Ala Val Phe Phe Ala Pro Phe 
195 200 205 

Gly Val Met Leu Cys Ser Tyr Leu Cys He Leu Asn Thr Val Arg Lys 
210 215 220 
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Asn Ala Val Arg Val His Asn Gin Ser Asp Ser Leu Asp Leu Arg Gin 
225 230 235 240 

Leu Thr Gly Ala Gly Leu Arg Arg Leu Arg Arg Gin Gin Gin Gin Ala 
245 250 255 

Ser Leu Asp Leu Ser Phe Lys Thr Lys Ala Phe Thr Thr lie Leu lie 
260 265 270 

Leu Phe Val Gly Phe Ser Leu Cys Trp Leu Pro His Ser Val Tyr Ser 
275 280 285 

Leu Leu Ser Ala Phe Ser Arg Arg Phe Tyr Tyr Ser Ala Ser Phe Tyr 
290 295 300 

Thr Thr Ser Thr Cys Val Leu Trp Leu Ser Tyr Leu Lys Ser Val Phe 
305 310 315 320 

Asn Pro lie Val Tyr Cys Trp Arg lie Lys Lys Phe Arg Glu Ala Cys 
325 330 335 

lie Glu Leu Leu Pro His Thr Phe Gin lie Leu Pro Lys Val Pro Glu 
340 345 350 

Arg lie Gin Arg Lys lie Gin Pro Ser Thr lie Tyr Val Cys Asn Glu 
355 360 365 

Asn Gin Ser Ala Val 
370 



<210> 10 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

gcgcgggatc caccatgggg cccaccagcg tcccg 35 



<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

gcgcggtcga cggaagaaga gttgacgttt cc 32 

<210> 12 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 12 

gcgcgggatc caccatggca actgccctcc eg 



32 



<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gegeggtega etcagttgea gaagatccca ttc 33 



<210> 14 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 15 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

agttcactcg agtcagtcct gttggttggg ttg 33 



<210> 16 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 17 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

gegeggtega ctcagaccac tgtgttgccc 30 



<400> 14 

ateageggat ccaccatggt catcatgggc cagt 



34 



<400> 16 

gcgcgggatc caccatgggc ggtttatact cagag 



35 
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<210> 18 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 18 

gcgctctaga ccaccatggc ctgtaacagc acaccc 36 



<210> 19 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

gcgcgtcgac ctagacagcg gattggtttt eg 32 
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